Capabilities and Limitations of LLMs

Understanding LLM capabilities and limitations is essential for prompt engineering. Large Language Models can write, explain, summarize, translate, brainstorm, classify, code, and reason through many types of tasks. However, they also have limits. They can make mistakes, misunderstand context, produce unsupported claims, and fail when instructions are vague.

A skilled prompt engineer does not treat AI as magic. Instead, they understand what the model is good at, where it needs guidance, and when human review is necessary.

What LLMs Can Do Well

LLMs are especially useful for language-heavy tasks. They can transform rough ideas into structured content, explain complex topics, generate alternatives, summarize long text, and help users think through problems.

Writing and Editing

LLMs can draft emails, blogs, reports, captions, scripts, outlines, and polished text.

Explanation

They can simplify difficult concepts for different learner levels and audiences.

Summarization

They can condense long text into key points, executive summaries, or action items.

Ideation

They can generate ideas for campaigns, projects, research topics, lessons, and content plans.

More Practical Capabilities

Capability	What It Means	Example Prompt
Classification	Sorting text into categories.	Classify these customer reviews as positive, negative, or neutral.
Extraction	Finding specific information from text.	Extract names, company names, and email addresses from this text.
Rewriting	Changing tone, style, or clarity.	Rewrite this paragraph in a more professional tone.
Coding Support	Generating, explaining, or debugging code.	Explain this Python error and suggest a fix.
Structured Planning	Organizing goals into steps.	Create a 30-day learning plan for SQL beginners.

What LLMs Cannot Do Perfectly

LLMs can produce impressive answers, but they are not always correct. They may generate responses that sound confident even when the information is incomplete or inaccurate. They may also struggle with very fresh information, highly specialized facts, hidden context, or tasks requiring exact calculation unless tools are used properly.

High-Risk Mistake: Using an AI response as final truth without checking it can lead to wrong decisions, especially in legal, medical, financial, academic, or business-critical situations.

Common Limitations of LLMs

Limitation	What Can Go Wrong	Prompting Strategy
Hallucination	The model may invent facts, names, sources, or details.	Ask for uncertainty, source checking, or use verified reference material.
Outdated Information	The model may not know recent changes unless connected to current sources.	Use search or provide updated context when current facts matter.
Ambiguity	The model may guess what the user means.	Define the task, audience, domain, and expected output clearly.
Weak Numerical Precision	The model may make calculation or logic errors.	Ask for step-by-step checking or use calculation tools when accuracy matters.
Overgeneralization	The answer may sound broad and generic.	Add specific context, examples, constraints, and decision criteria.

Capability Depends on Prompt Quality

The same model can produce a poor answer or a strong answer depending on the prompt. If the prompt is vague, the model may give a broad response. If the prompt defines the goal, role, context, format, and constraints, the output becomes more useful.

Weak Prompt

“Give marketing ideas.”

Better Prompt

“Give ten Instagram content ideas for a beginner-friendly data analytics training brand. The audience is college students and early-career professionals. Use an educational tone and include one caption hook for each idea.”

The better prompt improves the result because it narrows the domain, audience, platform, tone, and required output.

When Human Review is Necessary

Human review is necessary whenever the output affects decisions, reputation, safety, money, legal responsibility, academic quality, or customer experience. AI can assist the work, but the final judgment should remain with a person who understands the context.

Important: AI should be treated as an assistant, not an unquestioned authority. Verification is part of responsible prompt engineering.

[Image/Diagram: A two-column visual showing LLM capabilities on one side and limitations on the other, with human review as the bridge between them.]

How Prompt Engineering Reduces Limitations

Prompt engineering cannot remove every limitation, but it can reduce many problems. A good prompt can ask the model to state assumptions, separate facts from suggestions, avoid unsupported claims, follow a format, ask clarifying questions, or use only provided material.

Risk	Prompting Technique
Generic response	Add audience, goal, examples, and constraints.
Unsupported claims	Ask the model to mark uncertain statements and avoid invented facts.
Wrong format	Specify the exact format and provide a sample structure.
Misunderstood task	State the role, task, background, and expected output clearly.

Key Takeaways

LLMs are strong at writing, summarizing, explaining, classifying, extracting, coding support, and ideation.
LLMs can still make mistakes, hallucinate, misunderstand context, or produce outdated information.
Prompt quality strongly affects output quality.
Human review is essential for high-stakes or decision-focused work.
Good prompts reduce risk by adding context, constraints, examples, and verification instructions.

2.3 Capabilities and Limitations