Who should read this Intermediate level guide?

This guide is perfect for Intermediate level practitioners looking to improve their prompt engineering skills in Mental Model, Prompt Engineering, AI Mindset, Verification.

How long does it take to complete this guide?

This guide takes approximately 8 min read to read and understand.

What topics does this guide cover?

This guide covers: Mental Model, Prompt Engineering, AI Mindset, Verification.

Back to Guides/Guide

LLM as a Trickster

Work with the LLM as a confident trickster: creative but fallible. Generate, then verify.

September 19, 2025

8 min read

Promptise Team

Beginner

Mental ModelPrompt EngineeringAI MindsetVerification

It will sound confident even when it’s wrong. Mindset: “Trust, but verify.”

Hook & Promise

Some colleagues need guardrails. Large language models are that charming teammate who answers fast, never hedges, and occasionally invents a meeting that didn’t happen. The trick isn’t to muzzle the model; it’s to learn the cadence of working with a confident storyteller. By the end of this guide, you’ll know how to hear the music behind the words: when to lean in, when to slow down, and how to make the model prove its claims without killing momentum.

Lay of the Land

A modern LLM produces the most plausible next token given the context. Plausibility and truth are cousins, not twins. When the model “hallucinates,” it isn’t lying with intent; it’s completing a pattern with confidence. The voice sounds certain because the probability distribution is sharp, not because the facts are checked.

A trickster isn’t only a liar; in myth it’s also a boundary-tester. It discovers new paths by breaking expected rules. That impulse is useful for brainstorming and style, risky for facts and calculations. Treat the model like that: a brave explorer who needs a navigator.

The Move

The core move is simple: separate generation from verification. Ask the model to produce ideas, drafts, or answers, then switch its role from “storyteller” to “auditor.” Good work with LLMs alternates between those modes—create → check → revise—instead of asking for final truth in a single shot.

Think of this rhythm:

Story mode: “Give me the best coherent answer.”
Skeptic mode: “Now show me why I shouldn’t trust it.”
Referee mode: “Resolve contradictions and state what’s actually supported.”

If you internalize that cadence, the trickster becomes a sparring partner who sharpens your reasoning instead of a source of risk.

Show, Don’t Tell

Here’s one compact moment you can try with any non-sensitive claim.

What it does: First drafts an answer, then critiques itself, then cites plausible checks you can do.

You are my fast-first-draft.
1) Draft the answer to {{QUESTION}}.
2) Switch roles: list 3 ways this draft could be wrong (assumptions, edge cases, missing data).
3) Propose 2 verification steps with sources I could check (name the type of source, not a URL).
4) Rewrite the final answer with those caveats explicitly noted.
Return in sections: DRAFT / WAYS IT COULD BE WRONG / VERIFY / REWRITE.

You’ll notice the tone changes. The point isn’t to eliminate confidence; it’s to bind confidence to scrutiny.

A Picture of the Dance

Rendering chart...

Deepen: When the Trickster Helps—and Hurts

Great fit: ideation, tone exploration, analogies, outlines, reframing a problem, generating test cases, summarizing a known source you provide. In these, plausibility is the goal or you control the source.
Handle with care: claims about real-world facts, code that touches money or safety, legal or medical summaries, names and dates, anything reputational. Here, plausibility must bow to provenance.

💡 Insight: The model is least reliable when you ask for specifics it didn’t see directly (e.g., a niche figure’s middle name) and most reliable when you ask for structure-preserving transforms of text you supply (e.g., “rewrite thismemo at 200 words in a neutral tone”).

⚠️ Pitfall: “Cite your sources” is not a magic phrase. Many models will fabricate citations that fit the vibe of an academic reference. Better: ask for source types (“government registry,” “upstream library docs”) and validation procedures (“cross-check figure A against table B from the annual report”).

Working Intuition

Treat answers as claims with a confidence aesthetic. The aesthetic (voice, certainty, formatting) can mislead you into skipping checks. Build your own reflex:

Spot the fragile hinge. What single assumption would make this answer fall apart? Ask for it explicitly.
Demand falsifiability. “What observation would disprove this?” If there’s no clear disproof, you’re holding vibes, not knowledge.
Prefer verifiable scaffolds. Ask for tables with units, named data fields, and unambiguous definitions—things you or a tool can check.

In Practice

Use these micro-prompts to keep the model honest without throttling creativity.

Expose the assumption: “Underline any sentence in your answer that depends on an unverified fact. After the answer, list those facts as checkboxes.”
Force a counter-argument: “Argue against your own conclusion in 4 sentences. Do not change the claim; test it.”
Name the evidence, not the link: “List 3 primary source types that would confirm or refute this claim, and the exact field/section I should inspect.”
Quantify uncertainty: “State a 3-point confidence scale (low/med/high) and justify the rating with what is known vs. assumed.”

Troubleshooting the Trickster

It keeps fabricating sources. Shift from “give me links” to “design me a verification protocol.” You can then run the actual search yourself or with a retrieval tool.
It doubles down when uncertain. Add a role constraint: “If confidence < high, you must propose at least one disconfirming test before concluding.”
It waffles and becomes useless. Over-skepticism can paralyze. Restore a two-step rhythm: free generation first, then critique. Don’t blend them into every sentence.

Mini Lab (5 minutes)

Goal: Feel the create–check rhythm on a harmless topic.

Pick a mundane, checkable claim (e.g., the steps of renewing a library card in your city, or the warranty policy of a popular gadget).
Paste the “Show, Don’t Tell” prompt with your {{QUESTION}}.
From the output, choose one verification step you can execute quickly (e.g., skim an official page you find yourself).
Return to the model: “Incorporate this evidence: {{SHORT NOTE}}. Where did your original answer overreach?”

Expected output shape:

A confident draft, a short list of ways it could be wrong, 1–2 concrete verification ideas, and a revised answer with caveats. If your real-world check contradicts a detail, the revised answer should explicitly narrow scope or shift language from “is” to “appears” with source-based limits.

Why This Works

The trickster pattern isn’t a flaw; it’s a feature of token-by-token reasoning under uncertainty. Confidence is a style, not a guarantee. By changing the job description of the model—first creator, then critic—you exploit two superpowers: effortless fluency and tireless red-teaming. You keep the creative gains while lowering the cost of being wrong.

Boundaries & Trade-offs

Cost/latency: Two-step prompting is slower and uses more tokens. Save it for decisions where error costs are non-trivial.
Human loop: Final verification often requires you (or a tool) to touch reality—datasets, registries, code execution. Don’t outsource that last inch to prose.
Domain variance: In stable domains with well-structured sources (finance filings, RFCs), verification scaffolds work beautifully. In emerging domains, expect more “Not confirmed” outcomes; make that an acceptable state, not a failure.

Summary & Conclusion

Treat the LLM as a trickster: bold, generative, and occasionally wrong with flair. Its confidence is a surface effect of probability, not a promise of truth. When you separate generation from verification, you turn that boldness into a productive asset. The rhythm—create, critique, corroborate—keeps your velocity without sacrificing accuracy.

Over time, you’ll feel where the model shines (style, structure, reframing) and where it must be tethered (facts with consequences). Make falsifiability your north star, ask for evidence types, and require the model to carry its own caveats.

You’re not muzzling the trickster. You’re giving it a stage and a spotlight—plus a safety net that you control.

Next steps

Try the Mini Lab on a real work task with low stakes; practice the create–check loop.
Build a tiny “verification kit” prompt you can reuse (assumptions list, disconfirming test, confidence rating).
For high-stakes areas, pair the model with a retrieval or execution tool so checks touch real data.

Reflection: The next time the model sounds perfectly sure, what single observation—if you could get it—would make you change your mind?

Learning Paths

Structured Learning

Follow guided learning paths from beginner to advanced. Master prompt engineering step by step.

Explore Paths

Continue Your Learning Journey

Ready to Master More? Explore our comprehensive guides and take your prompt engineering skills to the next level.

Explore More Guides Browse Learning Paths