This guide covers how to use Claude.ai without wasting your usage budget. A few bad habits can quietly eat through your limits and slow you down.
Tokens are the units Claude uses to read and write. Roughly 1 token = ¾ of a word. Every message you send and every response you receive burns tokens from your usage limit.
This sentence, broken into tokens:
One turn = one message you send + one response Claude gives. The problem: Claude re-reads the entire conversation on every single turn. Costs compound fast.
You
Claude
You
Claude
You
MCP (Model Context Protocol) is the technology that lets Claude connect to external tools — Google Drive, Slack, Gmail, Canva, and more. In Claude.ai these are called Connectors.
When a connector is active, Claude processes it on every message — even when you're just drafting an email and don't need it. Burns your session and weekly limits faster.
Enable a connector only for the conversation where you need it. Toggle it off when done. Don't leave connectors running in every chat.
Connectors require a one-time setup. After that, you enable them per conversation — they are not on by default in every chat.
Go to Settings → Connectors, find the service you want, and connect it. Claude will prompt you to log in to that service — this links your personal account (e.g., your Google login for Drive).
- Once authenticated, it persists across sessions (web, desktop, iOS, Android).
- If a connector stops working, try disconnecting and reconnecting:
Settings → Connectors.
- 1Click + (lower left of chat input) or type
/to open the menu. - 2Hover over Connectors and toggle on only what you need for that conversation.
- 3Claude can now use those services when relevant to your requests.
- 4When done, toggle off via the same menu — or go to
Settings → Connectors →Disconnect to remove it entirely.
From + → Connectors → Tool access: switch to On demand if you have 10+ connectors active. This tells Claude to only use a connector when you explicitly ask, saving tokens.
Vague prompts cost more tokens. Claude guesses, asks clarifying questions, or produces output you'll revise across multiple turns. Be specific from the start.
"Can you help me write something?"
Claude has to ask what type of content, what tone, what audience — 3–4 extra turns you didn't need.
"Write a 3-sentence follow-up email to a client who received a proposal last week. Tone: professional, warm. CTA: schedule a 30-minute call."
One prompt. One response. Done.
"Hi Claude! Hope you're doing well. I was wondering if maybe you could possibly help me..."
Every extra word is a token. Claude doesn't need a greeting.
"Summarize this article in 5 bullet points. [paste text]"
Gets the job done.
Claude 4.7 is significantly more literal than previous versions. If you want an instruction to apply across the whole task, say so explicitly. Don't assume it will infer your intent.
Anthropic recommends placing your document(s) at the top of the prompt and your actual question or instruction at the end. This ordering can significantly improve output quality.
Given the document above, summarize the three most critical risks.
Wrapping sections in XML tags tells Claude exactly what belongs where. Especially useful for complex prompts with multiple inputs.
[Background info, relevant docs]
</context>
<task>
[What you want Claude to do]
</task>
<output_format>
[Bullet points / email / table / etc.]
</output_format>
Ask Claude to pull the relevant quotes or sections first, then do the analysis. This grounds the answer in the actual document before Claude starts reasoning — and produces more accurate results.
For coding and agentic tasks, Anthropic recommends setting effort to xhigh or at least high. For simple, quick tasks, medium or low is more token-efficient. Set it explicitly in your prompt:
— vs —
Quick task, low effort needed. Reformat this list alphabetically.
When generating visual output or design-adjacent work, Claude 4.7 defaults to warm cream backgrounds, serif fonts, and terracotta accents. If that's not what you want, specify your palette and visual direction upfront.
"Create a one-pager for this service."
You'll get the boutique coffee shop aesthetic whether you want it or not.
"Create a one-pager. Use a clean white background, sans-serif fonts (Inter or similar), blue and charcoal as primary colors. Corporate, not decorative."
The best prompts aren't about magic words. They give Claude what any good employee needs:
Instead of pasting the same background context into every chat, store it once in a Claude Project. That context loads efficiently without bloating individual conversations.
- 1Create a Project in Claude.ai — e.g., per client, topic, or workflow.
- 2Upload reference files once: templates, background docs, style guides. Don't paste these into every chat.
- 3Write Project Instructions — tell Claude its role, your preferred format, any constraints.
- 4Start short, focused conversations inside the project for each task.
- 5Start a new conversation for each distinct task. Never keep one thread running all week.
Claude.ai offers three models. Using a heavier model than the task needs burns more tokens for no gain. Match the model to the work.
Click the model name shown in the chat header or the dropdown near the message input. Select Haiku, Sonnet, or Opus from the list.
Haiku is lightest. Sonnet is moderate. Opus uses the most — it reasons more deeply per token. Using Opus on a simple task burns your limit for no gain.
Navigate here: Settings → Usage (left sidebar, both web and desktop app). Check it before you hit a wall — not after.
- ✓Write complete, specific prompts — no vague openers, no pleasantries
- ✓For long docs: paste the doc first, question last — use XML tags for complex prompts
- ✓Set effort level explicitly — high/xhigh for complex work, low/medium for simple tasks
- ✓Keep conversations under 6–7 turns — start fresh for each new task
- ✓Use Sonnet by default — drop to Haiku for simple tasks, Opus only when Sonnet isn't enough
- ✓Enable connectors only for the conversation that needs them — toggle off when done
- ✓Store recurring context in Projects — don't paste it into every chat
- ✓Check Settings → Usage regularly — watch session % and weekly "All models" %
- ✓Above 70% weekly — go lean: shorter prompts, fewer turns, Haiku where possible