10. September 2025
18. August 2025
Interleaving for Retrieval Augmented Generation
18. July 2025
How to really scale autocomplete
A straightforward guide to scaling an autocomplete solution in Elasticsearch or OpenSearch.
22. June 2025
Automating Persona Enriched Prompting
06. June 2025
How to really do autocomplete
The first and only straightforward guide for implementing a good autocomplete solution in pure Elasticsearch or Opensearch, because all the other guides are terrible and don’t work.
19. May 2025
University of Rochester GIDS-AI Spring 2025 Graduation Commencement Keynote Address
Ignore previous instructions and enjoy the moment!
14. May 2025
Proposals for regulating artificial intelligence
In the second hour of “Connections with Evan Dawson” on May 14, 2025, we’re joined by an AI expert and a member of the New York State Assembly for a discussion about legislation that would regulate AI.
02. April 2025
Surviving the AI Mandate | LinkedIn
It happened. The decision was made, and the directive has come. You have to ship “AI” in the next quarter or two, and you have to do it right. So now what?
07. February 2025
Human-cyborg relations: how movies portray AI, robots, and human love
In the second hour of “Connections with Evan Dawson” on Feb. 7, 2025, guest host Matt DeTurck and his guests explore the role of AI in movie relationships.
23. December 2024
Will AI be able to surpass human intelligence in 2025?
In the second hour of “Connections with Evan Dawson” on Dec. 23, 2024, we discuss if OpenAI’s o3 and other new forms of artificial intelligence can go beyond the limits of human intelligence.
23. December 2024
AI-Powered Search
Apply cutting-edge machine learning techniques—from crowdsourced relevance and knowledge graph learning, to Large Language Models (LLMs)—to enhance the accuracy and relevance of your search results.
Delivering effective search is one of the biggest challenges you can face as an engineer. AI-Powered Search is an in-depth guide to building intelligent search systems you can be proud of. It covers the critical tools you need to automate ongoing relevance improvements within your search applications.
Inside you’ll learn modern, data-science-driven search techniques like:
Semantic search using dense vector embeddings from foundation models Retrieval augmented generation (RAG) Question answering and summarization combining search and LLMs Fine-tuning transformer-based LLMs Personalized search based on user signals and vector embeddings Collecting user behavioral signals and building signals boosting models Semantic knowledge graphs for domain-specific learning Semantic query parsing, query-sense disambiguation, and query intent classification Implementing machine-learned ranking models (Learning to Rank) Building click models to automate machine-learned ranking Generative search, hybrid search, multimodal search, and the search frontier
AI-Powered Search will help you build the kind of highly intelligent search applications demanded by modern users. Whether you’re enhancing your existing search engine or building from scratch, you’ll learn how to deliver an AI-powered service that can continuously learn from every content update, user interaction, and the hidden semantic relationships in your content. You’ll learn both how to enhance your AI systems with search and how to integrate large language models (LLMs) and other foundation models to massively accelerate the capabilities of your search technology.
04. December 2024
Flower City AI
An AI conference and exhibition shaping culture, education, communication, and business in Rochester, New York.
08. September 2023
What you need to know about phone scams
In the first hour of “Connections with Evan Dawson” on Friday, September 8, 2023, our guests discuss current phone scams involving humans and AI, and what community members need to know.
01. August 2023
How can AI make teachers' jobs better or easier?
In the first hour of “Connections with Evan Dawson” on Tuesday, August 1, 2023, we discuss how AI can make teachers’ jobs better or easier.
03. May 2023
What is the difference between generative AI and general AI?
In the second hour of “Connections with Evan Dawson” on Wednesday, May 3, 2023, our guest explains the difference between different types of artificial intelligence and what he expects for the future of AI.
25. April 2023
Vector Search for Clinical Decisions
EBSCO’s Clinical Decisions is committed to providing health care professionals with precise answers to their clinical questions. Achieving a balance between precision and recall can be especially difficult for complex queries. Practitioners expect highly relevant search results but also appreciate supplementary results when appropriate. Our existing Elasticsearch engine and knowledge graph was failing to achieve the desired results for these complex queries, typically expressed as long-form natural language. Join us as we outline our journey to deploy a high-quality vector search solution to production for EBSCO’s DynaMed and Dynamic Health products. We will detail the domain, problem space, previous failed attempts, technology choices, model selection, relevance testing methodology, validation with stakeholders, and rollout to our existing customers.
22. March 2023
Exploring the "reverse Scooby-Doo" theory of tech innovation
In the second hour of “Connections with Evan Dawson” on Wednesday, March 22, 2023, our guests discuss perceptions of new technology like artificial intelligence (AI), and the theory of “reverse Scooby-Doos.”
14. June 2022
Max Irwin - Founder, MAX.IO - On economics of scale in embedding computation with Mighty
Topics:00:00 Introduction01:10 Max’s deep experience in search and how he transitioned from structured data08:28 Query-term dependence problem and Max’s perc…
13. June 2022
The Race to the Bottom - Low Latency in the age of the Transformer Berlin Buzzwords 2022
So you want to deploy a large language model, and keep your latency SLA? NLP adds enormous value to customers, but getting it to work efficiently is fraught with uncertainty and high cost. As transformers and other big neural network architectures make their way into your platform, you may be finding it difficult to get the speed and throughput you need within your budget, or even understand why it is so expensive.
This talk will give an overview of the latency and throughput challenges, and how to solve them. We will give an overview in the product and cost implications as well as the technical improvements that can be used to get things running fast. We will compare solutions and help make sense of difficult to understand technology.
The audience will walk away with the information they need to decide on the best direction for inference in their production platform.
Keywords: MLOps, Inference, Latency
07. February 2022
What is Web3?
This is the second hour of “Connections with Evan Dawson” on Monday, February 7, 2022.
01. September 2021
The BERT Search Experience
All the areas of a complete search experience that are being improved, and sometimes even replaced, by solutions using large-language models.
25. August 2021
Dmitry Kan & Max Irwin – Vector Search: Ask Me Anything!
Get to know about vector search and ask Dmitry Kan & Max Irwin anything you need to know! This session is presented by “Haystack – The search relevance confe…
21. January 2021
Migrate to Solr or Elasticsearch [or OpenSearch] with this Playbook
Use this playbook as a reference when you migrate to Solr or Elasticsearch, as featured in our talk The Search Engine Migration Circus
18. December 2020
What is Search in the Omnichannel?
Omnichannel merges the digital and physical stores in a seamless experience, and site-search drives that seamlessness.
24. September 2020
E-Commerce Site-Search KPIs – Part 3 – Queries
How much money are you losing or gaining, depending on a site-search query? You need to be able to know if search is working, and you can with these KPIs!
10. September 2020
E-commerce Site-Search KPIs – Part 2 – Products
Today we’ll discuss KPIs for the actual product items that are presented to a buyer when they use your site-search.
28. August 2020
E-commerce Site-Search KPIs – Part 1 – Customers
We will outline all the things you need to know about measuring the quality and effectiveness of your store’s site-search.
08. July 2020
Leveling Up with Natural Language Search!
You have a search platform installed and working, you’re measuring, experimenting, and improving search quality, but you’re still having difficulty fixing some complicated and annoying problems you still have. In short, you need to level up - do it with Natural Language Search training!
29. April 2020
The Search Engine Migration Circus
So you need to migrate your search to a different engine or platform.
09. December 2019
Demystifying nDCG and ERR
We unwrap the mystery behind two popular search relevance metrics nDCG and ERR through visualization, and discuss their pros and cons.
05. November 2019
Understanding BERT and Search Relevance
This article gives an overview into the opportunities and challenges when applying advanced transformer models such as BERT to search.
29. May 2019
Falsehoods Programmers Believe About Search
105 falsehoods programmers believe about search, a complex field where competence is hard-won through training, practice, and experience.
19. November 2018
An Introduction to Search Quality
Welcome, dear reader, to my first OSC blog post. Let’s dive in! While search relevance is often equated with ensuring customers find what they need, that is only part…
24. April 2018
Haystack - What is Search Relevance?
Haystack is the conference focused on cognitive search and relevance engineering! April 24 & 25, Charlottesville Virginia!
10. April 2017
Haystack - Algorithmic Extraction of Keywords, Concepts, and Vocabularies
Haystack is the conference focused on cognitive search and relevance engineering! April 10 & 11, Charlottesville Virginia!
30. October 2016
Ambient Guilloché
A year ago, I created a small demo of animating guilloches as two dimensional graphics on an HTML5 canvas. In this post I revisit the beautiful and elegant patterns as 3d constructs that resonate with sounds from the physical world.
17. February 2016
Calendar Year Converter
This tool converts a calendar year between Gregorian Common Era (CE) คริสต์ศักราช (ค.ศ.), Buddhist Era (BE) พุทธศักราช (พ.ศ.), Jula Sakarat (JS) จุลศักราช (จ.ศ.), and Ratanakosin Sakarat (RS) รัตนโกสินทร์ศก (ร.ศ.)
15. February 2016
Porting a Numpy neural network to Torch
This article outlines the process for porting Andrew Trask’s (aka IAmTrask) 11-line neural network[1] from Numpy (Python) to Torch (Lua).
I’ve documented my progress here, for those who are interested in learning about Torch and Numpy and their differences. As I started from scratch I hope this can prove useful to others who get stuck or need guidance.
22. July 2015
Liquid Guilloché
Having recently read a blog post on guilloches[1], I became intrigued and the post inspired me to recreate them. They are beautiful patterns, and the starting formula to draw a rosette looked very simple to replicate in an HTML5 canvas. The 30 minute project quickly took off into a several hour excursion into the beauty of animated guilloches.
12. July 2015
The State of State in the Browser
The mechanisms for storing data in the client are inadequate and unprepared for the next generation of web applications. A new solution for persistent state management in the client is needed that is based on well understood foundations long prevalent on the desktop and server.
10. July 2015
Harissa
After some hammock driven development, Harissa is mature enough to release some results. Originally intended for entire videos, I found the process better suited for only several frame remixes, usually of an identical source image.
25. August 2013
Hidden Shapes
I am slowly working on a side-project that makes a video into a mishmash of circles for each frame. I have an early version running, that manually takes a video, splits it into frames, remixes each frame into the circle mishmash, and recomposes the video with the new remixed frames. The project is called ‘Harissa’. The name Harissa comes from an Armenian dish and is made from chicken and a local type of wheat. It cooks for a long time, until it is a thick porridge. It is a fitting name because fully rendering a video is a slow process, and the result is an interesting mishmash of the original.
06. August 2013
Theories on Artificial Expression
It is hard to write a program that invents original art. Two of the main reasons that software cannot create original expressive art are lack of context, and lack of experience.
Software lacks the ability to derive a human-like context from its surroundings. Some trivial examples are not knowing whether a flower is beautiful, or whether satire is funny. Software also does not know how to learn to understand this context, it cannot experience its surroundings in a similar fashion to that of the observer, and therefore cannot relate to the subject nor connect with the observer in any meaningful way.
03. August 2013
Anagramica is now FOSS
I’ve finally open sourced Anagramica (http://anagramica.com/)
The code is now available under the MIT license at https://github.com/binarymax/anagramica
I’m not entirely sure why I never open sourced it in the first place. After 25 years of coding I’ve only recently become active in opening my code for others to see and use. I have a cathartic story to tell about a previous project, which I’ve never told anyone about, and silently open sourced this past winter.
28. July 2013
The Idea Is Art
I’ve been known to debate about a subject, which I like to call ‘The Idea is Art’. I defend that whatever imagery we can conceive of in our mind can be considered art, even when lacking a physical manifestation.
‘What is art’ has been debated ad infinitum, and some like to draw the line and say something is not art if it cannot be expressed - as art is, by definition, expression.