Guide Startups

Chunkr

Winter 2024 · Active

Open source API service to parse complex documents

Founded: 2023

Team size: 3

About

Battle-tested + highly modular vision infrastructure to convert PDFs, PPTs, Word, Excel, PNG, and JPEGs into LLM-ready data. We started by building lumina.sh - where we needed to parse ~600M pages of scientific literature. The researchers didn't care - but devs wanted our ingestion pipeline. So we built chunkr instead. We offer high quality layout analysis, OCR, bounding boxes, granular VLM controls, semantic chunking, and all the last mile engineering that goes into building standout AI applications. Common use-cases include RAG, and automating document workflows like invoices/medical reports -> database.

Founders

  • Mehul ChaddaCo-founder & CEO

    Co-founder & CEO at Chunkr. I have a background in metrology and a bsc in materials engineering. I work on helping computers read documents now.

  • Ishaan KapoorFounder

    co-founder @ chunkr

  • Akhilesh SharmaFounder

    Co-Founder @ Lumina I am a mechanical engineer from the University of Illinois Urbana Champaign. I have experience in robotics and as a cloud solutions architect.