diff --git a/_community_members/hajer.md b/_community_members/hajer.md new file mode 100644 index 000000000..15a6057b2 --- /dev/null +++ b/_community_members/hajer.md @@ -0,0 +1,7 @@ +--- +short_name: hajer +name: Hajer Bouafif +photo: "/assets/media/community/members/hajer.png" +linkedin: hajerbouafif +--- +**Hajer Bouafif** is a Senior Solutions Architect at AWS. Hajer focuses on data, AI, and search, with a background in big data engineering, providing organizations with best practices and well-architected reviews to build large-scale AI and search solutions. \ No newline at end of file diff --git a/_community_members/pmprasad.md b/_community_members/pmprasad.md index 54225dfd4..b7a3108de 100755 --- a/_community_members/pmprasad.md +++ b/_community_members/pmprasad.md @@ -5,6 +5,7 @@ photo: '/assets/media/community/members/pmprasad.jpg' job_title_and_company: 'Analytics Specialist @ AWS - OpenSearch' primary_title: 'Praveen Mohan Prasad' title: 'OpenSearch Community Member: Praveen Mohan Prasad' +linkedin: praveen-mohanprasad breadcrumbs: icon: community items: @@ -26,5 +27,4 @@ conference_id: redirect_from: '/authors/pmprasad/' --- -A search specialist working closely with Amazon OpenSearch service users. I actively research and experiment on using Artificial Intelligence for Search and help OpenSearch users adopt machine learning in OpenSearch and improve search relevance. - +**Praveen Mohan Prasad** is a search specialist with data science expertise who actively researches and experiments with using machine learning to improve search relevance. Praveen helps clients implement and operationalize strategies to improve the search experience. \ No newline at end of file diff --git a/_posts/2025-09-23-Introducing-OpenSearch-ai-demo-application.md b/_posts/2025-09-23-Introducing-OpenSearch-ai-demo-application.md new file mode 100644 index 000000000..30ff8a432 --- /dev/null +++ b/_posts/2025-09-23-Introducing-OpenSearch-ai-demo-application.md @@ -0,0 +1,92 @@ +--- +layout: post +title: "Introducing OpenSearch AI demos in Hugging Face: Your all-in-one AI search playground" +authors: + - pmprasad + - hajer + - jhandler +date: 2025-09-23 +categories: + - technical-post +meta_keywords: AI search, Agentic search, Semantic search, AI agents, RAG, Conversational search, Agentic search assistant, Colpali +meta_description: "We're thrilled to announce the launch of OpenSearch AI demos in Hugging Face, a set of cutting-edge demo applications that put the power of advanced AI search capabilities right at your fingertips." +--- + +We're thrilled to announce the launch of [OpenSearch AI demos](https://huggingface.co/spaces/opensearch-project/OpenSearch-AI) in [Hugging Face](https://huggingface.co/opensearch-project), a set of cutting-edge demo applications that put the power of advanced AI search capabilities right at your fingertips. Whether you're a machine learning (ML) engineer, search expert, data wizard, or product owner diving into the world of generative AI, OpenSearch AI demos are part of your new go-to playground for exploring and understanding the latest in search technology. + +![OpenSearch AI demo application](/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/Home-screen.png){: .img-fluid} + +OpenSearch AI Search is a comprehensive demo platform that showcases the full spectrum of AI search capabilities using OpenSearch. It includes three main modules: + + +* **AI Search**: Demonstrates AI search capabilities with OpenSearch (lexical search, neural sparse search, dense vector search, hybrid search, multimodal search). The **AI Search** module also demonstrates using large language models (LLMs) to improve search. The solution uses LLMs in two key ways: to rewrite queries with metadata filters for better retrieval, and to judge relevance, helping compare the impact of different search types on quality. Additionally, this section covers tuning hybrid search weightage per subquery and demonstrates how reranking helps improve the relevance of your search. In this demo, we use [a dataset](https://github.com/aws-samples/retail-demo-store) that contains 2,465 retail product samples that belong to different categories such as accessories, home decor, apparel, housewares, books, and instruments. Each product contains metadata, including the ID, current stock, name, category, style, description, price, image URL, and gender affinity of the product. The following image shows the AI Search UI. + +![OpenSearch AI demo application](/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-search-module.png){: .img-fluid} + +* **Multimodal RAG**: Using multimodal retrieval-augmented generation (RAG), you can unlock the insights hidden within unstructured PDF documents. Many PDFs contain multimodal elements like text, tables, and images, and relying solely on text-based processing risks overlooking critical information. This section addresses this challenge using two approaches: +1. Grounding multi-modalities to text: Sample PDFs are parsed using multimodal parsers like the [Unstructured Python module](https://pypi.org/project/unstructured/), where images and tables are summarized into text and stored as both sparse and dense vectors. Metadata links each chunk to its source (text, image, or table), enabling retrieval for question answering. +2. Using cutting-edge vision-language models for information retrieval: Multi-vector late interaction models like [ColPaLi](https://arxiv.org/abs/2407.01449) avoid the need for complex pipelines to process multimodal PDFs by generating multiple page-level vectors that capture all modalities. These localized representations enable fine-grained interactions between query tokens and image patches of PDF pages, allowing ColPaLi to compute high-resolution relevance scores across spatial layouts. A key advantage is interpretability: similarity scores can be visualized as heat maps over PDF pages, highlighting which segments—such as text blocks, tables, or figure captions—contributed most to retrieval (depicted in the following image). This solution uses a [dataset](https://huggingface.co/datasets/vespa-engine/gpfg-QA) consisting of images of 6,692 PDF pages (with text, tables, and charts) of public reports retrieved from [Norwegian Government Pension Fund Global](https://www.nbim.no/en/news-and-insights/reports/). + +![OpenSearch AI demo application](/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/RAG-module.png){: .img-fluid} + +* **AI shopping assistant**: While RAG is widely known for building QA systems over unstructured data, its application in e-commerce product search opens new possibilities for transforming the user experience. In this solution, we explore how combining RAG with AI agents on top of your OpenSearch product catalog creates an agentic shopping assistant capable of delivering a conversational shopping experience. By equipping the agent with capabilities such as user intent understanding, query rewriting, dynamic selection of search strategies, search result evaluation, feedback-based retries, and personalization, we enable adaptive, high-quality retrieval over product catalogs. This approach not only improves relevance but also allows users to interact naturally with the search system. In this module, we use a [dataset](https://github.com/shuttie/esci-s) derived from the original [Amazon ESCI](https://github.com/amazon-science/esci-data) e-commerce dataset. We use a subset of items under the "apparel" and "footwear" product categories with a total of 59.3K products. A sample AI shopping assistant interaction is depicted in the following image. + +![OpenSearch AI demo application](/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-shopping-agent-assistant.png){: .img-fluid} + +![OpenSearch AI demo application](/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-shopping-agent-assistant-2.png){: .img-fluid} + +![OpenSearch AI demo application](/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-shopping-agent-assistant-3.png){: .img-fluid} + + +The AI agent in this demo has the following defined tools: + +* `get_relevant_items_for_image`: Retrieves relevant products based on the provided image and text query by running a multimodal search. +* `generate_image`: Generates images based on the provided text description. +* `get_product_details`: Retrieves details about a specific product by looking up the OpenSearch index. +* `get_relevant_items_for_text`: Retrieves relevant products based on the provided text query by running a semantic search. +* `get_any_general_recommendation`: Provides fashion recommendations related to a specific product or in general. +* `retrieve_with_keyword_search`: Retrieves relevant products based on the provided text query by running a lexical search. + + + +## Under the hood: OpenSearch AI search demo architecture + + +ML integration in search systems significantly enhances user experience throughout the entire lifecycle, from ingesting documents to delivering highly relevant results. OpenSearch facilitates this integration through a [list of supported ML connectors](https://docs.opensearch.org/latest/ml-commons-plugin/remote-models/connectors/), enabling seamless interaction with ML and LLMs deployed on ML platforms such as [Amazon SageMaker AI](https://docs.aws.amazon.com/sagemaker/latest/dg/whatis.html) and [Amazon Bedrock](https://aws.amazon.com/bedrock/). The following image presents the AI search demo architecture. + + +![OpenSearch AI demo application](/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/ML-integrations-opensearch.png){: .img-fluid} + + +OpenSearch supports dual-pipeline architecture—comprising [ingest](https://docs.opensearch.org/latest/ingest-pipelines/) and [search pipelines](https://docs.opensearch.org/latest/search-plugins/search-pipelines/index/)—providing a robust framework for ML-driven search optimization. + +During ingestion, documents undergo ML-powered processing, using a combination of [ingest processors](https://docs.opensearch.org/latest/ingest-pipelines/processors/index-processors/#supported-processors) within the ingest pipeline to chunk long text documents and generate text embeddings. +During search, search pipelines use the power of [search processors](https://docs.opensearch.org/latest/search-plugins/search-pipelines/search-processors/) to transform queries, normalize hybrid search scores, and apply LLMs for text generation. + +OpenSearch ML integrations enable advanced search functionalities such as hybrid search, query rewriting, and RAG, effectively bridging the gap between traditional information retrieval and state-of-the-art ML techniques. + + +## Key benefits + +In the rapidly evolving landscape of AI and search technologies, it's crucial to understand which approaches work best for different use cases. OpenSearch AI demos allow you to perform the following tasks: + + +1. Compare and contrast search types: Test different AI search types side by side to see how they perform for your specific needs. +2. Explore cutting-edge techniques: Get hands-on experience with state-of-the-art methods like query rewriting using LLMs and multi-vector search with ColPali in RAG. +3. Witness agents in action: See how intelligent agents can enhance the search experience in real time. +4. One-stop shop: Access all these advanced search techniques integrated with ML and LLMs in a single user-friendly platform. + +## Start exploring + +OpenSearch AI demos are built for practical experimentation and learning. Here's how to get started: + + +1. Go to the [OpenSearch AI demos application](https://huggingface.co/spaces/opensearch-project/OpenSearch-AI) and start experimenting with different AI search use cases. +2. Try various queries for your use case, compare results, and see which techniques work best. +3. Got ideas for improvements? Check out our [GitHub repo](https://huggingface.co/spaces/opensearch-project/OpenSearch-AI/tree/main) to request features, report bugs, or contribute to OpenSearch AI demos. +4. Are you an AWS builder? Check out [this hands-on experience](https://catalog.workshops.aws/opensearch-ml-search/en-US) to build a next-generation search demo in your own AWS account. + + + +Ready to revolutionize your approach to AI search? Dive into OpenSearch AI demos today and discover the future of search technology! + diff --git a/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-search-module.png b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-search-module.png new file mode 100644 index 000000000..f33099912 Binary files /dev/null and b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-search-module.png differ diff --git a/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-shopping-agent-assistant-2.png b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-shopping-agent-assistant-2.png new file mode 100644 index 000000000..71bcd07a5 Binary files /dev/null and b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-shopping-agent-assistant-2.png differ diff --git a/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-shopping-agent-assistant-3.png b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-shopping-agent-assistant-3.png new file mode 100644 index 000000000..d21a4dcef Binary files /dev/null and b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-shopping-agent-assistant-3.png differ diff --git a/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-shopping-agent-assistant.png b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-shopping-agent-assistant.png new file mode 100644 index 000000000..15afbef4f Binary files /dev/null and b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/AI-shopping-agent-assistant.png differ diff --git a/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/Home-screen.png b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/Home-screen.png new file mode 100644 index 000000000..d48781e1e Binary files /dev/null and b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/Home-screen.png differ diff --git a/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/ML-integrations-opensearch.png b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/ML-integrations-opensearch.png new file mode 100644 index 000000000..8bf3a663a Binary files /dev/null and b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/ML-integrations-opensearch.png differ diff --git a/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/RAG-module.png b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/RAG-module.png new file mode 100644 index 000000000..b08e09334 Binary files /dev/null and b/assets/media/blog-images/2025-09-23-introducing-opensearch-ai-demo-application/RAG-module.png differ diff --git a/assets/media/community/members/hajer.png b/assets/media/community/members/hajer.png new file mode 100644 index 000000000..b804cc699 Binary files /dev/null and b/assets/media/community/members/hajer.png differ diff --git a/assets/media/community/members/pmprasad.jpg b/assets/media/community/members/pmprasad.jpg old mode 100755 new mode 100644 index 23f173968..17dd39413 Binary files a/assets/media/community/members/pmprasad.jpg and b/assets/media/community/members/pmprasad.jpg differ