RAG’s Long-standing Challenge: Synonyms

Handling synonyms is a critical challenge in Retrieval-Augmented Generation (RAG) systems, impacting both retrieval accuracy and generated responses. Allganize addresses this by integrating synonym dictionaries and prompt engineering, significantly improving search recall and response quality. Applying synonym handling across retrieval and generation ensures accurate, domain-specific, and user-friendly AI outputs.

1. Problem

In the fields of Information Retrieval (IR) and Natural Language Processing (NLP), handling synonyms has long been a critical challenge. This issue is particularly significant in specialized domains like healthcare, law, technology, and finance, where the same concept is often expressed in various ways. Failing to accurately match these expressions can lead to missing crucial information.

For example, in healthcare, "Electronic Medical Records" and "EMR" refer to the same concept but are expressed differently. If such synonyms are not managed, users may fail to retrieve desired information.

In Retrieval-Augmented Generation (RAG) systems, this synonym issue significantly impacts both retrieval accuracy and the quality of generated answers. RAG operates in two main stages:

Retriever: Locates documents relevant to the user’s query.
Generator: Generates answers based on the retrieved documents.

If the Retriever cannot handle synonyms, it may fail to find the correct documents, resulting in the Generator producing inaccurate or incomplete answers. This means the quality of answers can depend on how the question is phrased.

Example Question:
What are the potential issues when a patient directly views their electronic medical records?

Relevant Document (Correct Source):
Allowing patients to view their EMRs is a crucial step toward patient-centered care, but proper education and security measures are essential.

In this case, the user uses the term “electronic medical records,” but the document uses “EMR.” A keyword-based search method like BM25 may fail to recognize “EMR” as equivalent to “electronic medical records,” potentially missing the document entirely. As a result, the user cannot find the relevant document, and the system generates an incorrect answer.

Without proper synonym handling in RAG systems, important information may be overlooked.

1-1. Retriever

The implementation of the Retriever can be categorized into three methods: Keyword Search, Semantic Search, and Hybrid Search.

Keyword Search

In keyword-based search, handling synonyms like “electronic medical records” and “EMR” is often problematic. Traditional keyword matching algorithms, such as BM25, only retrieve exact matches. For instance, searching for “electronic medical records” would not return results containing “EMR,” as synonym handling is not supported.

NLP - Text Encoding: Word2Vec - AI PROJECTS

‍

Hybrid Search

Hybrid Search combines Keyword Search and Semantic Search, leveraging the strengths of both. As a result, it handles synonyms more effectively than keyword-based search but slightly less so than purely semantic search:
Keyword Search < Hybrid Search < Semantic Search in terms of synonym handling.

1-2. Generation

Even if the Retriever finds the correct document, the Generator may still fail to provide an accurate answer if it does not properly handle synonyms. For instance:

If a document contains the term “EMR,” but the Generator does not correctly interpret or convert it to “electronic medical records,” the user may receive an unclear or incomplete response.

To address this, Prompt Engineering can be used to explicitly guide the Generator in handling synonyms. For example, prompts can include instructions to recognize and harmonize various synonymous terms.

‍2. Solution

2-1. Retriever

A key solution to the synonym issue in the Retriever is to build and integrate a synonym dictionary into the search system. In search engines like Elasticsearch (ES), queries can be transformed to include synonyms, expanding the scope of the search.

When generating a search request (query), the system can map the user’s input keyword to a list of synonymous terms for retrieval. For example:

If a user searches for “electronic medical records,” the system can automatically translate it to include terms like “electronic medical records,” “EMR,” and “Electronic Medical Record.”

This approach ensures that the search encompasses all relevant expressions present in the document index.

Example of Synonym-based Query Construction:

User input: electronic medical records
Expanded query: ["electronic medical records", "EMR", "Electronic Medical Record"]

This configuration allows the system to retrieve documents containing any of these terms, significantly improving recall and accuracy.

Enhancing Search Performance with Synonym Dictionaries

The effectiveness of this approach depends on the quality of the synonym dictionary. A well-constructed dictionary tailored to specific domains can significantly improve search performance. Synonym dictionaries can be built either manually or automatically, ensuring that search systems capture domain-specific terminology.

Allganize’s Alli, for instance, combines domain-specific synonym dictionaries with user-managed dictionaries. This dual approach allows the system to continuously update, incorporating new terms and expressions as they emerge.‍

2-2. Generation

In the Generation stage, accurately reflecting synonyms is equally crucial. Even if the correct document is retrieved during the search, the generation model may fail to produce clear and precise answers if it does not recognize or utilize synonyms effectively.

To address this, synonyms relevant to the user's query can be explicitly added to the prompt. By doing so, the generation model can better interpret and incorporate synonymous terms into its responses.

Example of a Synonym-Enhanced Prompt:

User Input:
What are the risks of patients directly accessing their electronic medical records?
Augmented Prompt for the Generator:
"Electronic medical records (EMR) and related terms: Explain the risks of patients directly accessing their EMRs."

By incorporating synonymous terms into the prompt, the model is guided to understand and reflect relevant expressions, ensuring more accurate and user-friendly answers.

Example of Synonym Application in Alli's Prompt

Here is an example of how Alli integrates synonyms into prompts to enhance the accuracy and clarity of generated answers:

User Input:
What are the risks of patients accessing their electronic medical records?

Synonym-Enhanced Prompt Applied in Alli:
"The term 'electronic medical records' (also known as EMR or Electronic Medical Record) refers to patient medical data stored electronically. Based on this, explain the risks of patients accessing their EMRs."

By explicitly incorporating synonyms such as “EMR” and “Electronic Medical Record” into the prompt, Alli ensures that the generation model recognizes equivalent terms. This approach helps the model provide comprehensive and contextually accurate answers, improving user satisfaction.

3. Experiment

The experiment was conducted using 21 questions and corresponding documents. For each test:

Retriever extracted the top 3 results.
Generation produced answers based on the retrieved results, and human evaluators performed qualitative assessments.

The experiment consisted of three versions:

v1: No synonym handling applied (baseline performance).
v2: Synonym handling applied only at the Retriever stage.
v3: Synonym handling applied at both the Retriever and Generation stages.

Below are the specific configurations and results for each version.

v1: No Synonym Handling (Baseline)

Retriever: Standard keyword search without synonym mapping.
Generator: Generated answers based only on retrieved documents, without consideration of synonyms.

Outcome:

Low recall and answer accuracy.
Retrieval and generation were heavily impacted by differences in terminology.

v2: Synonym Handling in Retriever Only

Retriever: Included synonym mapping using a synonym dictionary to expand queries.
Generator: No explicit synonym handling; responses based on retrieved documents.

Outcome:

Improved recall due to broader query coverage.
Some issues remained in generation accuracy, as the generator did not account for synonym equivalence.

v3: Synonym Handling in Both Retriever and Generation

Retriever: Synonym mapping applied as in v2.
Generator: Synonym information integrated into the prompt to improve understanding and answer accuracy.

Outcome:

Significant improvement in both recall and answer quality.
Answers were more accurate and aligned with user intent, as both stages effectively accounted for synonym relationships.

Summary of Results

Applying synonym handling in both the Retriever and Generation stages (v3) yielded the best performance. It demonstrated the importance of addressing synonyms throughout the entire RAG pipeline to ensure high-quality search and response generation.

‍

4. Conclusion

Allganize effectively addresses synonym challenges in RAG systems by adopting strategies such as synonym dictionary construction and prompt engineering. These efforts have improved search accuracy and the quality of responses to user queries.

By combining domain-specific synonym dictionaries with user customization capabilities, Allganize provides a robust solution to complex search and response challenges in real-world business environments.

For enterprises seeking a reliable RAG system, Allganize is a trustworthy partner offering cutting-edge solutions to enhance search and response effectiveness. To know more contact us today.

What Is Enterprise Search? The True Backbone of Intelligent Knowledge Workflows

Allganize at InfoTech LIVE 2025: Navigating Uncertainty with Agentic AI for a Data-Driven Future

What Is MCP (Model Context Protocol)? Understanding Its Role in AI Infrastructure

RAG’s Long-standing Challenge: Synonyms

1. Problem

1-1. Retriever

Hybrid Search

1-2. Generation

‍2. Solution

2-1. Retriever

Enhancing Search Performance with Synonym Dictionaries

2-2. Generation

Example of Synonym Application in Alli's Prompt

3. Experiment

4. Conclusion