Enable the API cost tracking in Synthesizer #1406

chuqingG · 2025-03-02T05:04:58Z

Add a new parameter (False by default) to configure and enable complete cost tracking. An use case is shown as following:

synthesizer = Synthesizer(model="gpt-4o-mini", cost_tracking=True)
goldens = synthesizer.generate_goldens_from_docs(document_paths=doc_paths)
dataset = EvaluationDataset(goldens=goldens)

The output looks like:

vercel · 2025-03-02T05:05:02Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
evals-docs	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Mar 2, 2025 5:06am

penguine-ip · 2025-03-03T06:05:39Z

@chuqingG this looks great, and you reformatted too! will go over it in the next day and merge it asap, thanks!

penguine-ip · 2025-03-03T06:06:55Z

@chuqingG one question what is the reset cost for? i think the cumulative cost will scare a lot of people

chuqingG · 2025-03-03T14:58:05Z

@chuqingG one question what is the reset cost for? i think the cumulative cost will scare a lot of people

@penguine-ip I added reset_cost because technically generate_goldens_from_context() can be called either directly from outside or by generate_goldens_from_docs(). My intention is that the API cost only shows the overhead of one end-to-end synthesis call.

In other words, the API cost is accumulated when the generate_goldens_from_context is called from generate_goldens_from_docs, but it won't print out inside of generate_goldens_from_context. At the end of generate_goldens_from_docs, it will print the accumulated cost (generate_contexts, generate_goldens_from_context, and quality check).

(p.s., for the scaring thing, during my test, process 40 pdf files (1~12 pages each) using gpt-4o-mini cost ~0.2 usd, so it would be ~2 usd for o1-mini and ~30 usd for gpt-4o. For price-sensitive use scenarios, I also modified the quality check part to let small cheap models work smoothly in my personal branch. Feel free to let me know if you think it's okay to also merge this feature here!

Thanks!

penguine-ip · 2025-03-03T19:39:23Z

@chuqingG thanks for the detailed explanation, definitely ok and love the money bag emoji! Thanks :)

penguine-ip · 2025-03-03T19:40:08Z

@chuqingG for this "I also modified the quality check part to let small cheap models work smoothly in my personal branch."

What was it you had to modify? Any chance it can be used for deepeval?

chuqingG · 2025-03-03T19:49:51Z

@chuqingG for this "I also modified the quality check part to let small cheap models work smoothly in my personal branch."

What was it you had to modify? Any chance it can be used for deepeval?

@penguine-ip Thanks for the response, I will organize it and make another PR in the next day!

chuqingG added 3 commits March 1, 2025 17:00

enable cost tracking for synthesizer

874bab0

Merge branch 'main' of github.com:chuqingG/deepeval

f6bf569

reformat

499c64c

penguine-ip merged commit 96bb890 into confident-ai:main Mar 3, 2025
8 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable the API cost tracking in Synthesizer #1406

Enable the API cost tracking in Synthesizer #1406

chuqingG commented Mar 2, 2025 •

edited

Loading

vercel bot commented Mar 2, 2025 •

edited

Loading

penguine-ip commented Mar 3, 2025

penguine-ip commented Mar 3, 2025

chuqingG commented Mar 3, 2025 •

edited

Loading

penguine-ip commented Mar 3, 2025

penguine-ip commented Mar 3, 2025

chuqingG commented Mar 3, 2025

Enable the API cost tracking in Synthesizer #1406

Enable the API cost tracking in Synthesizer #1406

Conversation

chuqingG commented Mar 2, 2025 • edited Loading

vercel bot commented Mar 2, 2025 • edited Loading

penguine-ip commented Mar 3, 2025

penguine-ip commented Mar 3, 2025

chuqingG commented Mar 3, 2025 • edited Loading

penguine-ip commented Mar 3, 2025

penguine-ip commented Mar 3, 2025

chuqingG commented Mar 3, 2025

chuqingG commented Mar 2, 2025 •

edited

Loading

vercel bot commented Mar 2, 2025 •

edited

Loading

chuqingG commented Mar 3, 2025 •

edited

Loading