Fondo | Zep Launches: Fast, Accurate Structured Data Extraction for AI Assistant Apps

Launch YC: Zep: Fast, accurate structured data extraction for AI assistant apps

‍
^{"Memory for AI Apps"}

‍

^{10x faster than GPT-4o, with field format and validity guarantees.}

_‍^‍
_‍Founded by Daniel Chalef

Many business and consumer apps must extract structured data from conversations between an LLM-powered Assistant and a human user. Often, the extracted data is the objective of the conversation. Consider completing a sales order, making a reservation, or requesting leave. All of these tasks require progressively collecting data from the conversation.

Latency and correctness are important. You will often want to identify the correct data values you have collected and those you still need. You’ll then prompt the LLM to request the latter.

https://youtu.be/k8e8NsoVzFo

If you’re making multiple calls to an LLM to extract and validate data on every chat turn, you’re likely adding significant latency to your response. This can be a slow and inaccurate exercise, frustrating your users.

The Solution

Zep’s new Structured Data Extraction feature is a low-latency, high-accuracy tool for extracting the data you need from Chat History stored in Zep's Long-term Memory service.

Up to 10x faster than gpt-4o. For many multi-field extraction tasks, you can expect latency of under 400ms, with the addition of fields increasing latency sub-linearly.

‍

Comparing Zep with LLM JSON Mode

Many model providers offer a JSON inference mode that guarantees the output will be well-formed JSON.

However:

There are no guarantees that the field values will conform to the JSON Schema you define or that they are correct (vs. being hallucinated).
All fields are extracted in a single inference call, with additional fields adding linearly or greater to extraction latency.

‍

Preprocessing, Guided LLM Output, and Validation

To ensure that the extracted data is in the format you expect and is valid given the current dialog, Zep uses a combination of:

dialog preprocessing, which, amongst other things, improves accuracy for machine-transcribed dialogs;
using guided output inference techniques on LLMs running on Zep's own infrastructure;
and post-inference validation.

You will not receive back data in an incorrect format when using a Zep field type such as email, zip code, or date time.

While there are limits to the extraction accuracy when the conversation is very nuanced or ambiguous, carefully crafting field descriptions can achieve high accuracy in most cases.

‍

Up to 10x Faster than OpenAI gpt-4o

When comparing like-to-like JSON Schema model extraction against gpt-4o, Zep is up to 10x faster.
‍

Zep's extraction latency scales sub-linearly with the number of fields in your model. That is, you may add additional fields with a low marginal increase in latency.

‍

Support for Partial and Relative Dates

Zep understands various date and time formats, including relative times such as “yesterday” or “last week.” It can also parse partial dates and times, such as “at 3pm” or “on the 15th.”

‍

Extracting from Speech Transcripts

Zep can understand and extract data from machine-transcribed transcripts. Spelled-out numbers and dates will be parsed as if they were written language. Utterances such as “uh” or “um” are ignored.

‍

Using Progressive Data Extraction To Guide LLMs

Your application may need to collect several fields to accomplish a task. Since Zep's SDE is so fast, you can guide the LLM through this process by calling the extractor on every chat turn. This enables you to identify which fields are still needed and then direct the LLM to collect the remaining data.

‍

Learn More

_{🌐 Visit}_{www.getzep.com}_{to learn more}

_‍_‍

*_{🧠 Learn more in the}_{Zep Structured Data Guide}***

‍

*_{📝 Sign up for Zep's}_{Long-term Memory Service for AI Assistants}***

‍

*_{👣 Follow Zep on}_LinkedIn***

_‍

‍

Posted

March 14, 2025

Launch

David J. Phillips

CEO & Founder

View Posts

About The Author

David is the CEO & Founder of Fondo (YC W18). He is an angel investor in Rippling, Flexport, LiquidDeath, and 85+ other startups. David began his career as an accountant at Deloitte before learning to code and becoming a founder. Previously, he was co-founder of Hackbright where 1,000+ software engineers have been trained and placed at tech companies including Slack, Disney, and Uber and was acquired by Capella Education NASDAQ: $CPLA in 2016.

← Back to all posts

Zep Launches: Fast, Accurate Structured Data Extraction for AI Assistant Apps

Save time, money, and run a better startup.

The all-in-one accounting platform for startups. Bookkeeping, taxes, and tax credits on autopilot.

‍"Memory for AI Apps"

10x faster than GPT-4o, with field format and validity guarantees.

The Solution

Comparing Zep with LLM JSON Mode

Preprocessing, Guided LLM Output, and Validation

Up to 10x Faster than OpenAI gpt-4o

Support for Partial and Relative Dates

Extracting from Speech Transcripts

Using Progressive Data Extraction To Guide LLMs

Learn More

🌐 Visit www.getzep.com to learn more

‍‍

🧠 Learn more in the Zep Structured Data Guide

📝 Sign up for Zep's Long-term Memory Service for AI Assistants

👣 Follow Zep on LinkedIn

‍

‍

Featured

VibeKit 🖖 by Superagent Launches: The Safety Layer for Your Coding Agent

Liva AI Launches: Real Voice & Video Data for AI

🪄 FlyCode Launches: Increase Stripe Revenue, by Leveraging Backup Cards ✨

Categories

David J. Phillips

About The Author

Simplify Startup Finances Today

Take the stress out of bookkeeping, taxes, and tax credits with Fondo’s all-in-one accounting platform built for startups. Start saving time and money with our expert-backed solutions.

Simplify Startup Finances Today

Take the stress out of bookkeeping, taxes, and tax credits with Fondo’s all-in-one accounting platform built for startups. Start saving time and money with our expert-backed solutions.

Zep Launches: Fast, Accurate Structured Data Extraction for AI Assistant Apps

‍"Memory for AI Apps"

10x faster than GPT-4o, with field format and validity guarantees.

The Solution

Comparing Zep with LLM JSON Mode

Preprocessing, Guided LLM Output, and Validation

Up to 10x Faster than OpenAI gpt-4o

Support for Partial and Relative Dates

Extracting from Speech Transcripts

Using Progressive Data Extraction To Guide LLMs

Learn More

🌐 Visit www.getzep.com to learn more

‍‍

🧠 Learn more in the Zep Structured Data Guide

📝 Sign up for Zep's Long-term Memory Service for AI Assistants

👣 Follow Zep on LinkedIn

‍

‍

David J. Phillips

About The Author

Join Our Newsletter and Get the LatestPosts to Your Inbox

Featured

VibeKit 🖖 by Superagent Launches: The Safety Layer for Your Coding Agent

Liva AI Launches: Real Voice & Video Data for AI

🪄 FlyCode Launches: Increase Stripe Revenue, by Leveraging Backup Cards ✨

Categories

Newsletter

Save time, money, and run a better startup.

The all-in-one accounting platform for startups. Bookkeeping, taxes, and tax credits on autopilot.

Products

Resources

About

Get started ⚡

‍
^{"Memory for AI Apps"}

^{10x faster than GPT-4o, with field format and validity guarantees.}

_{🌐 Visit}_{www.getzep.com}_{to learn more}

_‍_‍

*_{🧠 Learn more in the}_{Zep Structured Data Guide}***

*_{📝 Sign up for Zep's}_{Long-term Memory Service for AI Assistants}***

*_{👣 Follow Zep on}_LinkedIn***

_‍

_‍

‍
^{"Memory for AI Apps"}

^{10x faster than GPT-4o, with field format and validity guarantees.}

_{🌐 Visit}_{www.getzep.com}_{to learn more}

_‍_‍

*_{🧠 Learn more in the}_{Zep Structured Data Guide}***

*_{📝 Sign up for Zep's}_{Long-term Memory Service for AI Assistants}***

*_{👣 Follow Zep on}_LinkedIn***

_‍

_‍

Join Our Newsletter and Get the Latest
Posts to Your Inbox