Long term memory for LLMs

This project implements a Retrieval-Augmented Generation (RAG) system using LangChain and Chroma, with OpenAI APIs for language understanding. It maintains long-term, user-specific memory, enabling the system to store, retrieve, and selectively delete past memory based on relevance or demand.

The LLM identifies the pieces of information that are worth keeping or removing, while memory metadata and IDs are persistently stored using SQLite, CSV, or JSON for tracking.

The architecture supports chunked documents, semantic search, and conversational querying, making it adaptable for chatbots, personal assistants, where specific information needs to be retained over time.

Check test_ouput.md for tested prompts and simulataneous memory updates that happen.

Check system_design.md for high level architecture and other design details.

Instructions

Download the repository

Go to a terminal and paste git clone https://github.com/santosh-gs/llm-memory-recovery.git

OpenAI API Key

Before you begin, set up an OpenAI account and generate a new key. You will need to put your credentials in a .env file.

Go to https://platform.openai.com/api-keys and create an OpenAI key.
Create a .env file in your project directory and add OPENAI_API_KEY="your_generated_secret_key".
The .gitignore file will ensure that your API key remain in your local system.

Running the application

Inside a terminal, run the following:

Navigate to the cloned folder using cd /llm-memory-recovery
Create a virtual environment using python -m venv myenv or pyhon3 -m venv myenv for Mac or Linux systems (Optional if you already have the required libraries)
Activate the virtual enviroment (if any) using
myenv\Scripts\activate (if using Windows Command Prompt)
.\myenv\Scripts\Activate.ps1 (if using Windows PowerShell)
source myenv/Scripts/activate (if using Git Bash in Windows or Bash in UNIX systems)
Install dependencies pip install -r requirements.txt
Run python main.py in the terminal or open the folder in a code editor like VS Code

Note: Do not open the persistent_memory_user.csv in excel while running the llm as it locks the file to read-only mode and prevents modifications.
Rather open it in VS Code to track live memory updates.

LLM Instructions

You will asked to prompt your query as follows
Ask your question (q to quit): give your prompt
If the model thinks any info within the prompt is worth remembering. Check system_prompt.txt for detailed analysis that takes place.
Prompt q to quit.

Example Prompts

Hi, I am Santosh. I like PyTorch and TensorFlow. In which years were these frameworks released?
I like apples, bananas, and chess. Which of apple and banana contain more protein per fruit?
Who am I?
What do you know about me? List everything
Which frameworks do I like?
I no longer like TensorFlow, it's so overwhelming.
Which frameworks do I like now?

The memory is stored in persistent_memory_user0.csv in the data folder. As long as the user remains same, the memory persists and can be recalled.

If you want to start over, prompt delete all memory or change USER_ID variable to something like user1.

Prompt Engineering (Response + Action)

Here, I have used rarely used ASCII characters þ and ÿ as separators/delimiters between response and action to be taken regarding memory.
The actions include add delete ignore

`ignore` action

Just respond to query and need not add any info to memory.

`add` action

Add concise summarized key statements to memory.

`delete` action

Delete memory entry that's no longer needed.

Updating Existing Memory Entry

To update an existing memory entry, we use add and delete one after the other
add final desired entry
delete existing entry
e.g. suppose a memory entry can be "User like Cricket and Football" Now if the user no longer likes Football, we run:
add "User likes Cricket" delete "User like Cricket and Football"
Check system_prompt.txt for detailed instructions and prompt template.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
system_design.md		system_design.md
system_prompt.txt		system_prompt.txt
test_output.md		test_output.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Long term memory for LLMs

Instructions

Download the repository

OpenAI API Key

Running the application

LLM Instructions

Example Prompts

Prompt Engineering (Response + Action)

`ignore` action

`add` action

`delete` action

Updating Existing Memory Entry

About

Uh oh!

Languages

License

santosh-gs/llm-memory-recovery

Folders and files

Latest commit

History

Repository files navigation

Long term memory for LLMs

Instructions

Download the repository

OpenAI API Key

Running the application

LLM Instructions

Example Prompts

Prompt Engineering (Response + Action)

ignore action

add action

delete action

Updating Existing Memory Entry

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages

`ignore` action

`add` action

`delete` action