Hi, my name is Ankit Josh.

I build efficient systems.

Working as a Software Engineer at DataCore, contributing to the AI+ product.

Passionate about algorithms, AI, and building scalable systems.

Projects I'm proud of

AI Project

VisionText

Multi-modal image search system using CLIP embeddings and Qdrant vector database. Supports text-to-image and image-to-image search with hybrid ranking pipeline using RRF. Includes batch processing, async API, and a modern web UI. Dockerized for easy deployment. View code.

Technologies:

  • Python
  • FastAPI
  • CLIP (SigLIP)
  • Qdrant
  • Docker
VisionText Image Search.

Systems Project

PyKeyDB

Thread-safe in-memory key-value database with write-ahead logging and async networking. Supports strings, lists, hashes, and sets with ACID transactions. Implements WAL (Write-Ahead Logging) for durability and achieves 84K+ writes/sec and 2.6M+ reads/sec. View code.

Technologies:

  • Python
  • asyncio
  • Threading
  • Write-Ahead Log
PyKeyDB

AI Project

AudioFingerprinting

A Shazam-like audio recognition system. Ingest songs from YouTube, fingerprint them using spectral analysis, and identify them by recording from a microphone. Uses constellation-based hashing with STFT peak detection and time-offset histogram matching for song identification. View code.

Technologies:

  • Python
  • FastAPI
  • librosa
  • SQLite
  • yt-dlp
AudioFingerprinting

Let's connect

Whether you have a project idea or just want to discuss tech, I'd love to hear from you.

Email me