Max Irwin






22. June 2025

Automating Persona Enriched Prompting

Approaching prompting from a domain, persona, and user context perspective. Add some PEP as a step!

For awhile now, I’ve been using a technique that I have developed which I call Persona Enriched Prompting (“PEP”). With this technique, you instruct the model to generate knowledge about the inferred domain and then generate UX personas for that domain contextual to the problem. Once this context is generated, only then do you continue on to final answer generation.

more

06. June 2025

How to really do autocomplete

The first and only straightforward guide for implementing a good autocomplete solution in pure Elasticsearch or Opensearch, because all the other guides are terrible and don’t work.

bonsai.io

23. December 2024

AI-Powered Search

Apply cutting-edge machine learning techniques—from crowdsourced relevance and knowledge graph learning, to Large Language Models (LLMs)—to enhance the accuracy and relevance of your search results.

Delivering effective search is one of the biggest challenges you can face as an engineer. AI-Powered Search is an in-depth guide to building intelligent search systems you can be proud of. It covers the critical tools you need to automate ongoing relevance improvements within your search applications.

Inside you’ll learn modern, data-science-driven search techniques like:

Semantic search using dense vector embeddings from foundation models Retrieval augmented generation (RAG) Question answering and summarization combining search and LLMs Fine-tuning transformer-based LLMs Personalized search based on user signals and vector embeddings Collecting user behavioral signals and building signals boosting models Semantic knowledge graphs for domain-specific learning Semantic query parsing, query-sense disambiguation, and query intent classification Implementing machine-learned ranking models (Learning to Rank) Building click models to automate machine-learned ranking Generative search, hybrid search, multimodal search, and the search frontier

AI-Powered Search will help you build the kind of highly intelligent search applications demanded by modern users. Whether you’re enhancing your existing search engine or building from scratch, you’ll learn how to deliver an AI-powered service that can continuously learn from every content update, user interaction, and the hidden semantic relationships in your content. You’ll learn both how to enhance your AI systems with search and how to integrate large language models (LLMs) and other foundation models to massively accelerate the capabilities of your search technology.

www.manning.com

04. December 2024

Flower City AI

An AI conference and exhibition shaping culture, education, communication, and business in Rochester, New York.

flowercity.ai

25. April 2023

Vector Search for Clinical Decisions

EBSCO’s Clinical Decisions is committed to providing health care professionals with precise answers to their clinical questions. Achieving a balance between precision and recall can be especially difficult for complex queries. Practitioners expect highly relevant search results but also appreciate supplementary results when appropriate. Our existing Elasticsearch engine and knowledge graph was failing to achieve the desired results for these complex queries, typically expressed as long-form natural language. Join us as we outline our journey to deploy a high-quality vector search solution to production for EBSCO’s DynaMed and Dynamic Health products. We will detail the domain, problem space, previous failed attempts, technology choices, model selection, relevance testing methodology, validation with stakeholders, and rollout to our existing customers.

haystackconf.com

13. June 2022

The Race to the Bottom - Low Latency in the age of the Transformer Berlin Buzzwords 2022

So you want to deploy a large language model, and keep your latency SLA? NLP adds enormous value to customers, but getting it to work efficiently is fraught with uncertainty and high cost. As transformers and other big neural network architectures make their way into your platform, you may be finding it difficult to get the speed and throughput you need within your budget, or even understand why it is so expensive.

This talk will give an overview of the latency and throughput challenges, and how to solve them. We will give an overview in the product and cost implications as well as the technical improvements that can be used to get things running fast. We will compare solutions and help make sense of difficult to understand technology.

The audience will walk away with the information they need to decide on the best direction for inference in their production platform.

Keywords: MLOps, Inference, Latency

pretalx.com

08. July 2020

Leveling Up with Natural Language Search!

You have a search platform installed and working, you’re measuring, experimenting, and improving search quality, but you’re still having difficulty fixing some complicated and annoying problems you still have. In short, you need to level up - do it with Natural Language Search training!

opensourceconnections.com

30. October 2016

Ambient Guilloché

A year ago, I created a small demo of animating guilloches as two dimensional graphics on an HTML5 canvas. In this post I revisit the beautiful and elegant patterns as 3d constructs that resonate with sounds from the physical world.

more

17. February 2016

Calendar Year Converter

This tool converts a calendar year between Gregorian Common Era (CE) คริสต์ศักราช (ค.ศ.), Buddhist Era (BE) พุทธศักราช (พ.ศ.), Jula Sakarat (JS) จุลศักราช (จ.ศ.), and Ratanakosin Sakarat (RS) รัตนโกสินทร์ศก (ร.ศ.)

more

15. February 2016

Porting a Numpy neural network to Torch

This article outlines the process for porting Andrew Trask’s (aka IAmTrask) 11-line neural network[1] from Numpy (Python) to Torch (Lua).

I’ve documented my progress here, for those who are interested in learning about Torch and Numpy and their differences. As I started from scratch I hope this can prove useful to others who get stuck or need guidance.

more

22. July 2015

Liquid Guilloché

Having recently read a blog post on guilloches[1], I became intrigued and the post inspired me to recreate them. They are beautiful patterns, and the starting formula to draw a rosette looked very simple to replicate in an HTML5 canvas. The 30 minute project quickly took off into a several hour excursion into the beauty of animated guilloches.

more

12. July 2015

The State of State in the Browser

The mechanisms for storing data in the client are inadequate and unprepared for the next generation of web applications. A new solution for persistent state management in the client is needed that is based on well understood foundations long prevalent on the desktop and server.

more

10. July 2015

Harissa

After some hammock driven development, Harissa is mature enough to release some results. Originally intended for entire videos, I found the process better suited for only several frame remixes, usually of an identical source image.

more

25. August 2013

Hidden Shapes

I am slowly working on a side-project that makes a video into a mishmash of circles for each frame. I have an early version running, that manually takes a video, splits it into frames, remixes each frame into the circle mishmash, and recomposes the video with the new remixed frames. The project is called ‘Harissa’. The name Harissa comes from an Armenian dish and is made from chicken and a local type of wheat. It cooks for a long time, until it is a thick porridge. It is a fitting name because fully rendering a video is a slow process, and the result is an interesting mishmash of the original.

more

06. August 2013

Theories on Artificial Expression

It is hard to write a program that invents original art. Two of the main reasons that software cannot create original expressive art are lack of context, and lack of experience.

Software lacks the ability to derive a human-like context from its surroundings. Some trivial examples are not knowing whether a flower is beautiful, or whether satire is funny. Software also does not know how to learn to understand this context, it cannot experience its surroundings in a similar fashion to that of the observer, and therefore cannot relate to the subject nor connect with the observer in any meaningful way.

more

03. August 2013

Anagramica is now FOSS

I’ve finally open sourced Anagramica (http://anagramica.com/)

The code is now available under the MIT license at https://github.com/binarymax/anagramica

I’m not entirely sure why I never open sourced it in the first place. After 25 years of coding I’ve only recently become active in opening my code for others to see and use. I have a cathartic story to tell about a previous project, which I’ve never told anyone about, and silently open sourced this past winter.

more

28. July 2013

The Idea Is Art

I’ve been known to debate about a subject, which I like to call ‘The Idea is Art’. I defend that whatever imagery we can conceive of in our mind can be considered art, even when lacking a physical manifestation.

‘What is art’ has been debated ad infinitum, and some like to draw the line and say something is not art if it cannot be expressed - as art is, by definition, expression.

more