Chunkr
Winter 2024 · Active
Open source API service to parse complex documents
Founded: 2023
Team size: 3
About
Battle-tested + highly modular vision infrastructure to convert PDFs, PPTs, Word, Excel, PNG, and JPEGs into LLM-ready data.
We started by building lumina.sh - where we needed to parse ~600M pages of scientific literature. The researchers didn't care - but devs wanted our ingestion pipeline. So we built chunkr instead.
We offer high quality layout analysis, OCR, bounding boxes, granular VLM controls, semantic chunking, and all the last mile engineering that goes into building standout AI applications. Common use-cases include RAG, and automating document workflows like invoices/medical reports -> database.
Founders
Mehul ChaddaCo-founder & CEO
Co-founder & CEO at Chunkr. I have a background in metrology and a bsc in materials engineering. I work on helping computers read documents now.
Ishaan KapoorFounder
co-founder @ chunkr
Akhilesh SharmaFounder
Co-Founder @ Lumina I am a mechanical engineer from the University of Illinois Urbana Champaign. I have experience in robotics and as a cloud solutions architect.