Show HN: GibRAM an in-memory ephemeral GraphRAG runtime for retrieval

nirdiamant · 2026-01-18T17:50:37 1768758637

The separate graph and vector storage can indeed add overhead for short-lived tasks. I've found that using a dual-memory architecture, where episodic and semantic memories coexist, can streamline this process and reduce complexity. If you're interested in seeing how this could work, I put together some tutorials on similar setups: https://github.com/NirDiamant/agents-towards-production

VoidWhisperer · 2026-01-18T08:15:35 1768724135

Out of curiosity, did you settle on that name before or after the RAM availability/price issues?

ktyptorio · 2026-01-18T17:11:14 1768756274

Actually, the name definitely came after noticing RAM prices. Though the idea where the graph-in-memory only for ephemeral RAG sessions came first, we won't pretend the naming wasn't influenced by RAM being in the spotlight.

mirekrusin · 2026-01-18T08:35:05 1768725305

GrrHDD

zwaps · 2026-01-18T11:25:33 1768735533

Very cool, kudos

Where might one see more about what type of indexing you do to get the graph?

threecheese · 2026-01-18T16:00:24 1768752024

Appears to be: https://github.com/gibram-io/gibram/blob/main/sdk/python/gib...

ktyptorio · 2026-01-18T17:12:15 1768756335

Exactly, thank you. Still in LLM-based extraction.

ekianjo · 2026-01-18T11:59:53 1768737593

how do you search the graph network?

ktyptorio · 2026-01-18T16:59:34 1768755574

There are two steps:

Vector search (HNSW): Find top-k similar entities/text units from the query embedding

Graph traversal (BFS): From those seed entities, traverse relationships (up to 2 hops by default) to find connected entities

This catches both semantically similar entities AND structurally related ones that might not match the query text.

Implementation: https://github.com/gibram-io/gibram/blob/main/pkg/engine/eng...

kordlessagain · 2026-01-18T17:49:06 1768758546

This is how I did it a few years back while working for a set store company. It works well.