Write a PREreview

High-Performance Vector Database

by Abiodun Oketunji and Kyriakos Gkikas

Posted: July 30, 2025
Server: Preprints.org
DOI: 10.20944/preprints202507.2499.v1

This paper presents a study of a high-performance vector database implementation in Go, addressing the growing need for efficient similarity search systems in machine learning and artificial intelligence applications. The research contributes a novel architecture that combines multiple indexing strategies including linear search, Locality-Sensitive Hashing (LSH), and Inverted File (IVF) indexing within a unified framework. Our implementation demonstrates superior performance characteristics compared to existing solutions, achieving sub-millisecond query times for datasets containing up to 100,000 high-dimensional vectors. The system architecture incorporates advanced concurrency patterns, memory management optimisations, and a RESTful API design that ensures scalability and maintainability. Extensive empirical evaluation across different workloads and vector dimensions validates the effectiveness of our approach, with particular emphasis on real-world machine learning scenarios involving embedding similarity search. The research provides both theoretical analysis of the implemented algorithms and practical guidelines for deployment in production environments.

You can write a PREreview of High-Performance Vector Database. A PREreview is a review of a preprint and can vary from a few sentences to a lengthy report, similar to a journal-organized peer-review report.

Before you start

We will ask you to log in with your ORCID iD. If you don’t have an iD, you can create one.