Morphik recently launched!

Launch YC: Morphik: Open‑source multimodal RAG that works on technical docs

"Your Data. Your Intelligence. No Hallucinations"

TL;DR
  • Problem: Knowledge workers spend a lot of their time searching complex documentation with diagrams, tables, or spreadsheets, just to find relevant information. Text‑only RAG misses details, hallucinates, and provides half-baked responses.
  • Solution: Morphik embeds whole pages (image + text, no OCR), auto‑builds a knowledge graph, and serves it through a UI, SDK, or REST API. They have 90 % accuracy on arXiv QA (and they can fine-tune for your use case 🙂).

Founded by Adityavardhan Agrawal & Arnav Agrawal

Watch their demo video: https://www.youtube.com/watch?v=3adZQhm5JZY

Problem

RAG breaks the moment documents stop being blobs of text.

  • Knowledge workers spend a lot of their time (people they spoke with said 50-70% time just in the research phase). eg. electrical engineers finding perfect PCB spec, pharma companies trying to search ELNs for similar experiments.
  • 80% of enterprise knowledge lives in diagrams, tables, images, CAD, scanned pdfs, not .txt.
  • “Multimodal” models still hallucinate or miss key context because the retrieval step sees only text, or they rely on tools to retrieve diagrams, or context. (https://docs.morphik.ai/blogs/gpt-vs-morphik-multimodal)
  • Teams glue together OCR, vector DBs, and regex‑heavy ETL that becomes a brittle monstrosity.

Solution

Directly embedding and search over page images

Image Credits: Morphik

When you ingest in Morphik, in addition to OCR, they create direct embeddings for the image of a page (or frames in a video). When you ask “Colpali comparison to standard RAG”, the LLM sees the actual page image, so multimodal models can reason over arrows, labels, scales, and colours just like a human.

Rules Engine

Image Credits: Morphik

When ingesting, you can define rules for data extraction (which would again consider tables, diagrams), or define natural language rules for PII redaction, or content transformation.

Deployment ready, scalable

Morphik comes built in with folder scoping, end user scoping, GPU accelerations, and fine grained permissions, so you can prototype and deploy with enterprise grade features. It’s end‑to‑end, so you don’t scramble to make duct tape fixes.

One more thing…Morphik Research Agent

morphik gif

Their research agent uses Morphik’s retrieval, graph, and extraction APIs as tools. It chains them automatically, jumping from doc to doc, node to node, seeing figures, till it finds the needle in the haystack, or has enough context to answer your questions. “Describe the third pin on the USBC21 diagram” will get the diagram, find the pin, jump to the table with pin descriptions, jump to page 13 to “find more details on page 13”.

Why we might be interesting to you

  • Building an internal ChatGPT over messy manuals? Morphik is the fastest way to ship something users trust.
  • Developing an AI product? Swap your vector‑DB‑plus‑duct‑tape stack for a single SDK call.
  • Researching graph‑RAG? CAG? We have it all.


Learn More

🌐 Visit www.morphik.ai to learn more.
🙏  Introductions to teams drowning in technical docs (healthcare, manufacturing, legal, electrical engineers).
🤝 Tell us what still hurts in your RAG stack, comment, DM, email the founders here, or book a call.
⭐ Star / watch https://github.com/morphik-org/morphik-core – open source momentum matters.
👣 Follow Morphik on LinkedIn & X.

Posted 
May 9, 2025
 in 
Launch
 category
← Back to all posts  

Join Our Newsletter and Get the Latest
Posts to Your Inbox

No spam ever. Read our Privacy Policy
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.