👋 Welcome!

I am Lorenzo — AI Research Engineer — I produce novel research and code leveraging Large Language Models (GPTs and LLMs). I focus on workflows automation with AI Agents and code generation.
Also check out my research on a new generation of vector databases. **Make database think as LLMs think**..

🔬 Explore my research, protocols and Open Source implementations

📝 Blog Posts Collection

Why arrowspace is game-changing for data operations at scale

Test‑bed milestone for a unified vector, graph, and key‑value engine built on spectral indexing and energy‑informed search.

  • Turns any dataset into a features graph, enabling manifold‑aware search, matching, ranking, and dataset characterization at any lifecycle stage.
  • Designed for high dimensions by default: robust on biotech‑scale sequences, large vocabularies, and model‑sized embedding spaces.

Read more →

Efficient GPT training: a dive into the architecture of a Rust-powered GPT-2

Deep Dive into a Rust implementation of a decoder-only transformer inspired by Karpathy's nanochat.

  • Breaks down the architecture of a modern LLM, explaining the role of key components for an experienced audience.
  • Covers modern techniques such as Rotary Position Embeddings (RoPE), Multi-Query Attention (MQA), RMSNorm, and the use of a Squared ReLU in the MLP.

Read more →

DeepSeek-OCR Optical Compression Meets Energy Search: Rust Implementation in ArrowSpace v0.18.0

Rust implementation of DeepSeek-OCR compression achieves 10× token reduction, while ArrowSpace v0.18.0 introduces energy-informed retrieval that replaces cosine similarity with spectral graph properties.

  • DeepEncoder architecture (SAM + CLIP + projector) replicated in Rust using burn.dev with cross-platform GPU support and five resolution modes from 64 to 400 tokens.
  • Energy search with diffusion parameter sweep on CVE corpus achieves NDCG@10 ≈ 0.99 (η=0.05, steps=6) and MRR=1.0 (η=0.05, steps=4) without any cosine similarity.

Read more →

The Next Evolution in AI Memory: Energy-Informed Vector Search

Vector databases have become the backbone of modern AI workflows, particularly in RAG systems. But traditional approaches are fundamentally limited—they miss the deeper structural patterns that define how information relates within domains. Discover how ArrowSpace introduces energy-informed indexing through taumode, enabling AI systems with memory that truly understands domain contexts through spectral signatures and graph Laplacian energy.

Read more →
Dig my previous research at pramantha.net
Curious? Start exploring the pages above or reach out via the Contact page for questions, collaborations, or sponsorship info!