LLM subscriptions vs. APIs value for money

Asad Dhamani

03 Mar 2025 • 6 min read

This is my second post on this topic. The first post is here ChatGPT Plus, Claude Pro, vs API usage

Per my calculations, the $20 subscriptions of ChatGPT and Claude provide incredible value for money. Let's quantify it.

Claude Pro ($20 per month)

The Claude Usage Extension, that has been quite accurate in my usage, tells us that the usage limit for Claude Pro resets every 5 hours.

Every 5 hours, you get:

1.6 million tokens for Sonnet
500K tokens for Opus
3 million tokens for Haiku

This is a total across input and output, where output is weighed at 10x compared to input. That means 1K output tokens count as 10K tokens used, while 1K input tokens count as 1K tokens used.

Since a 3:1 input to output blend is typical in calculating blended costs for LLM APIs, we'll stick with that ratio.

Based on the numbers, this is how much we can use for each model per 5 hours.

Model	Total Cap (tokens)	Approx. Input Tokens	Approx. Output Tokens	Input Cost (USD)	Output Cost (USD)	Total Cost (USD)
Claude 3 Opus	500,000	~115,384	~38,461	~$1.73	~$2.88	~$4.61
Claude 3.7 Sonnet	1,600,000	~369,231	~123,077	~$1.11	~$1.85	~$2.96
Claude 3.5 Haiku	3,000,000	~692,308	~230,769	~$0.55	~$0.92	~$1.47

So every 5 hours, if you were maxing out your account, you would be able to use $4.61 + $2.96 + $1.47 = $9.04 worth of AI usage (if you used the same amount via API). I am ignoring prompt caching for simplicity. There are technically 4.8 5 hour periods in a day, so if you maxed out each of those 5 hour periods, you'd spend ~$43.4 worth. For a typical 30 day month, that's ~$1300.

$1300. That's the theoretical MAX value that you could extract out of that $20 plan vs. using the API directly.

Even if you aren't maxing it out, even just 10% of that is still $130, which is 6.5x of what you paid.

Claude Max ($100 per month)

Note: added on 23rd April 2025

Anthropic added a new set of Max plans, for $100 for 5x the usage of the Pro plan, and $200 for 20x the usage of the Pro plan. We'll look at the $100 plan.

The Max plan has a Session limit, which as far as I can tell, the Pro plan doesn't.

You are limited to 50 "sessions" per month, and each session is a 5 hour window.

Each 5 hour window gives you a token budget of

1.6 million tokens for Sonnet x 5 = 8 million tokens for Sonnet
500K tokens for Opus x 5 = 2.5 million tokens for Opus
3 million tokens for Haiku x 5 = 15 million tokens for Haiku

With 50 sessions per month you get

8 million x 50 = 400 million tokens for Sonnet
2.5 million x 50 = 125 million tokens for Opus
15 million x 50 = 750 million tokens for Haiku

Model	Total Cap (tokens)	Approx. Input Tokens	Approx. Output Tokens	Input Cost (USD)	Output Cost (USD)	Total Cost (USD)
Claude 3 Opus	125Mn	~93Mn	~32Mn	~$1395	~$2400	~$3795
Claude 3.7 Sonnet	400Mn	~300Mn	~100Mn	~$900	~$1500	~$2400
Claude 3.5 Haiku	750Mn	~560Mn	~190Mn	~$448	~$760	~$1208

If you Max out (heh) the $100 Max plan, that's $3795 + $2400 + $1208 = ~$7400 worth of API usage equivalent, 74x the amount you pay.

If you Max out a single 5 hr session for only Sonnet, which is the only model I use, you can use:

6 Mn input tokens = $3 per mn input tokens x 6 = $18
2 Mn output tokens = $15 per mn output tokens x 2 = $30

For a total of $48 per maxed out 5 hr session. This is going to make me way more carefree about using Claude vs. the API, and per What's in my AI toolkit, using this with Repo Prompt will save me money on the API too. Using the API can have an inhibitory effect that fixed cost subscription can bypass.

I think this is an incredible deal.

ChatGPT Plus ($20 per month)

ChatGPT is more complicated due to the additional services they provide like Advanced Voice Mode, Deep Research, etc.

Looking at just the models though. Per OpenAI

As of Feb 1, 2025, Plus users will be able to send up to 80 messages every 3 hours on GPT-4o and up to 40 messages every 3 hours on GPT-4. We may reduce the limit during peak hours to keep GPT-4 and GPT-4o accessible to the widest number of people.

Source: https://help.openai.com/en/articles/7864572-what-is-the-chatgpt-model-selector

So 80 messages to GPT-4o and 40 messages to GPT-4 every 3 hours. Both of these models have up to 32K token context window on ChatGPT Plus.

You also get access to their "o" series reasoning models.

For o3-mini, you get 150 messages per day.

o3‑mini will replace OpenAI o1‑mini in the model picker, offering higher rate limits and lower latency, making it a compelling choice for coding, STEM, and logical problem-solving tasks. As part of this upgrade, we’re tripling the rate limit for Plus and Team users from 50 messages per day with o1‑mini to 150 messages per day with o3‑mini.

Source: https://openai.com/index/openai-o3-mini/

And for o1, you get 50 messages per week.

With a ChatGPT Plus, Team or Enterprise account, you have access to 50 messages a week with OpenAI o1 and 50 messages a day with OpenAI o1-mini.

Source: https://help.openai.com/en/articles/9824962-openai-o1-o1-mini-and-o3-mini-usage-limits-on-chatgpt-and-the-api

Let's look at GPT-4o and GPT-4, we'll skip GPT-4o mini for now.

Assuming the same 3:1 input:output token blend, and assuming we absolutely max out the context window on each message, that'll be 24K input tokens and 8K output tokens per message.

GPT-4o in the API costs $2.5/$10 per mn input / output tokens, whereas GPT-4 costs $30/$60 for the same.

Model	Total Cap (tokens)	Approx. Input Tokens	Approx. Output Tokens	Input Cost (USD)	Output Cost (USD)	Total Cost (USD)
GPT‑4o	2,560,000	1,920,000	640,000	$4.80	$6.40	$11.20
GPT‑4	1,280,000	960,000	320,000	$28.80	$19.20	$48.00

So every 3 hours, if you were maxing out both the GPT models, you'd be using $59.2 worth of AI vs. if you used it via the API.

If you absolutely maxed these out, you'd use 240x the above as there are 240 3 hour periods in the average month.

The math for that is as follows:

Model	Total Cap (tokens)	Approx. Input Tokens	Approx. Output Tokens	Input Cost (USD)	Output Cost (USD)	Total Cost (USD)
GPT‑4o	614,400,000	460,800,000	153,600,000	$1,152.00	$1,536.00	$2,688.00
GPT‑4	307,200,000	230,400,000	76,800,000	$6,912.00	$4,608.00	$11,520.00

Let's look at the cost for o1 and o3-mini, for just one message below.

I want to clarify, OpenAI says the context window for o1 and o3-mini on the Plus plan is 32K tokens but there's some ambiguity there in my experience, but let's just stick with 32K tokens for now.

Model	Total Cap (tokens)	Approx. Input Tokens	Approx. Output Tokens	Input Cost (USD)	Output Cost (USD)	Total Cost (USD)
OpenAI o1	32,000	24,000	8,000	$0.36	$0.48	$0.84
OpenAI o3‑mini	32,000	24,000	8,000	$0.03	$0.04	$0.06

If you max out your quota for the reasoning models, you'd use 4500 messages a month on o3 mini and 21o messages a month on o1.

Model	Total Cap (tokens)	Approx. Input Tokens	Approx. Output Tokens	Input Cost (USD)	Output Cost (USD)	Total Cost (USD)
OpenAI o1	6,720,000	5,040,000	1,680,000	$75.60	$100.80	$176.40
OpenAI o3‑mini	144,000,000	108,000,000	36,000,000	$118.80	$158.40	$277.20

Advanced Voice Mode is super expensive too on the API, more than GPT-4o on its own, and Deep Research, but it won't change the bottom line here.

You are getting $2688 + $11520 + $176 + $277 = $14,661 worth of usage for your $20 every month.

If you only used 1% of that, that's still $146 worth of usage, or 7.3x of what you paid!

Conclusion

The $20 per month AI plans are bonkers value for money. People are quick to suggest using the API to "save money", but, would you, really? If you were using it heavily, absolutely not. Not to mention the hidden cost that seeing your API usage bill go up and doing mental arithmetic with every single message you send. Will that hinder your usage? Would you use it more, and hence, save more time and get more value out of it, if you knew it was a fixed cost? In my experience that's absolutely true.

Claude Pro ($20 per month)

Claude Max ($100 per month)

ChatGPT Plus ($20 per month)

Conclusion

Sign up for more like this.