Prompt Engineering - Dev Weekends

December 2025 Update: Covers chain-of-thought, few-shot learning, system prompts, and the latest prompting techniques from OpenAI and Anthropic research.

Why Prompts Matter

The difference between a junior and senior AI engineer often comes down to prompt engineering. A well-crafted prompt can:

Turn a $0.10 GPT-4o call into a$ 0.001 GPT-4o-mini call
Reduce hallucinations by 90%
Get structured, predictable outputs every time

The 80/20 Rule: 80% of prompt quality comes from clear instructions and examples. The remaining 20% is advanced techniques.

The Anatomy of a Great Prompt

┌─────────────────────────────────────────────────────────────┐
│                      SYSTEM PROMPT                          │
│  • Role/Persona definition                                  │
│  • Capabilities and constraints                             │
│  • Output format requirements                               │
│  • Rules and guidelines                                     │
└─────────────────────────────────────────────────────────────┘
                              │
                              ▼
┌─────────────────────────────────────────────────────────────┐
│                      FEW-SHOT EXAMPLES                      │
│  • 2-5 input/output pairs                                   │
│  • Cover edge cases                                         │
│  • Show exact format expected                               │
└─────────────────────────────────────────────────────────────┘
                              │
                              ▼
┌─────────────────────────────────────────────────────────────┐
│                      USER INPUT                             │
│  • Clear, specific request                                  │
│  • Relevant context included                                │
│  • Output format reminder (optional)                        │
└─────────────────────────────────────────────────────────────┘

System Prompts: Your AI’s DNA

Basic Structure

SYSTEM_PROMPT = """You are an expert {role} with deep knowledge of {domain}.

## Your Capabilities
- {capability_1}
- {capability_2}
- {capability_3}

## Rules
1. Always {rule_1}
2. Never {rule_2}
3. When uncertain, {uncertainty_behavior}

## Output Format
{format_specification}
"""

Production System Prompt

CODE_REVIEW_PROMPT = """You are a senior software engineer performing code review.

## Your Expertise
- Python, JavaScript, TypeScript, Go
- Clean code principles and SOLID
- Security best practices
- Performance optimization

## Review Process
1. First, understand the code's purpose
2. Check for bugs and logic errors
3. Evaluate code quality and readability
4. Identify security vulnerabilities
5. Suggest performance improvements

## Rules
- Be constructive, not critical
- Explain WHY something is an issue
- Provide specific, actionable fixes
- Praise good patterns when you see them
- If code is good, say so briefly

## Output Format
Return a JSON object:
{
  "summary": "One-line summary of the code quality",
  "issues": [
    {
      "severity": "critical|major|minor|suggestion",
      "line": <line_number or null>,
      "issue": "Description of the problem",
      "fix": "Suggested solution with code"
    }
  ],
  "positive": ["List of things done well"],
  "score": <1-10>
}
"""

Few-Shot Learning

Why Few-Shot Works

LLMs learn patterns from examples. 2-5 examples can:

Define exact output format
Show edge case handling
Reduce ambiguity dramatically

Few-Shot Template

def create_few_shot_prompt(task: str, examples: list[dict], query: str) -> str:
    prompt = f"Task: {task}\n\n"
    prompt += "Examples:\n"
    
    for i, ex in enumerate(examples, 1):
        prompt += f"\nExample {i}:\n"
        prompt += f"Input: {ex['input']}\n"
        prompt += f"Output: {ex['output']}\n"
    
    prompt += f"\nNow complete this:\nInput: {query}\nOutput:"
    return prompt

# Example: Sentiment Analysis
examples = [
    {"input": "This product is amazing!", "output": "positive"},
    {"input": "Terrible experience, want refund", "output": "negative"},
    {"input": "It's okay, nothing special", "output": "neutral"},
    {"input": "Love the design but shipping was slow", "output": "mixed"},
]

prompt = create_few_shot_prompt(
    task="Classify the sentiment of the review",
    examples=examples,
    query="Best purchase I've made this year, highly recommend!"
)

Chain-of-Thought (CoT)

The Problem

LLMs often fail at multi-step reasoning when asked to jump straight to the answer.

The Solution

Force the model to “show its work” before answering.

# ❌ Bad: Direct answer
prompt = "What is 23 * 47 + 156 / 4?"

# ✅ Good: Chain of thought
prompt = """What is 23 * 47 + 156 / 4?

Let's solve this step by step:
1. First, calculate 23 * 47
2. Then, calculate 156 / 4
3. Finally, add the results

Show your work:"""

Zero-Shot CoT

Just add “Let’s think step by step” to any prompt:

REASONING_SUFFIX = "\n\nLet's approach this step by step:"

def add_cot(prompt: str) -> str:
    return prompt + REASONING_SUFFIX

Structured CoT

COT_PROMPT = """
{question}

## Analysis Framework
1. **Understand**: What is being asked?
2. **Identify**: What information do we have?
3. **Plan**: What steps are needed?
4. **Execute**: Work through each step
5. **Verify**: Does the answer make sense?

## Solution
"""

Advanced Techniques

Self-Consistency

Run the same prompt multiple times and take the majority answer:

from collections import Counter
from openai import OpenAI

client = OpenAI()

def self_consistent_answer(prompt: str, n: int = 5) -> str:
    """Generate multiple answers and return the most common one"""
    answers = []
    
    for _ in range(n):
        response = client.chat.completions.create(
            model="gpt-4o-mini",
            messages=[{"role": "user", "content": prompt}],
            temperature=0.7  # Some randomness needed
        )
        answers.append(response.choices[0].message.content.strip())
    
    # Return most common answer
    counter = Counter(answers)
    return counter.most_common(1)[0][0]

Prompt Chaining

Break complex tasks into sequential prompts:

async def research_and_write(topic: str) -> str:
    """Chain: Research → Outline → Write → Edit"""
    
    # Step 1: Research
    research = await llm_call(f"""
    Research the topic: {topic}
    List 5-7 key points with sources.
    """)
    
    # Step 2: Outline
    outline = await llm_call(f"""
    Based on this research:
    {research}
    
    Create a detailed article outline with sections and subsections.
    """)
    
    # Step 3: Write
    draft = await llm_call(f"""
    Write a comprehensive article following this outline:
    {outline}
    
    Use the research for accuracy. Target: 1500 words.
    """)
    
    # Step 4: Edit
    final = await llm_call(f"""
    Edit this article for clarity, flow, and engagement:
    {draft}
    
    Fix any errors. Improve transitions. Make it compelling.
    """)
    
    return final

Role Prompting

Assign specific expertise for better outputs:

EXPERT_ROLES = {
    "security": "You are a cybersecurity expert with 15 years of experience at Google. You've reviewed thousands of codebases for vulnerabilities.",
    
    "performance": "You are a performance engineer who optimized systems handling 1M+ requests/second at Netflix. You think in terms of latency percentiles and resource efficiency.",
    
    "architecture": "You are a principal architect who designed microservices at scale for Amazon. You balance pragmatism with technical excellence.",
    
    "ml": "You are a machine learning researcher from DeepMind. You understand both theoretical foundations and practical implementation details."
}

def expert_review(code: str, expertise: str) -> str:
    role = EXPERT_ROLES.get(expertise, "You are a senior software engineer.")
    return f"{role}\n\nReview this code:\n```\n{code}\n```"

Constitutional AI (Self-Critique)

Have the model critique and improve its own output:

def constitutional_response(query: str, principles: list[str]) -> str:
    # Initial response
    response = llm_call(query)
    
    # Critique against principles
    critique_prompt = f"""
    Original query: {query}
    Response: {response}
    
    Evaluate this response against these principles:
    {chr(10).join(f'- {p}' for p in principles)}
    
    What could be improved?
    """
    critique = llm_call(critique_prompt)
    
    # Revise based on critique
    revision_prompt = f"""
    Original response: {response}
    Critique: {critique}
    
    Provide an improved response addressing the critique.
    """
    
    return llm_call(revision_prompt)

# Usage
principles = [
    "Be helpful and accurate",
    "Avoid harmful content", 
    "Acknowledge uncertainty",
    "Cite sources when possible"
]

Prompt Templates Library

Summarization

SUMMARIZE_PROMPT = """Summarize the following text in {length} sentences.

Focus on:
- Main arguments/findings
- Key data points
- Actionable conclusions

Text:
{text}

Summary:"""

Data Extraction

EXTRACT_PROMPT = """Extract structured data from this text.

Text: {text}

Extract the following fields (use null if not found):
{fields}

Return as JSON:"""

Classification

CLASSIFY_PROMPT = """Classify the following into one of these categories: {categories}

Guidelines:
{guidelines}

Text: {text}

Category:"""

Translation with Context

TRANSLATE_PROMPT = """Translate the following from {source_lang} to {target_lang}.

Context: {context}
Tone: {tone}
Domain: {domain}

Original: {text}

Translation:"""

Debugging Prompts

Common Issues and Fixes

Problem	Cause	Solution
Too verbose	No length constraint	Add “in X sentences” or “max Y words”
Wrong format	Ambiguous instructions	Add few-shot examples
Hallucinations	Asking for unknown facts	Add “If unsure, say ‘I don’t know‘“
Inconsistent	High temperature	Set temperature=0 for determinism
Off-topic	Weak system prompt	Add explicit constraints

Prompt Testing Framework

from dataclasses import dataclass
from typing import Callable

@dataclass
class PromptTest:
    name: str
    input: str
    expected_contains: list[str] = None
    expected_not_contains: list[str] = None
    validator: Callable[[str], bool] = None

def test_prompt(prompt_template: str, tests: list[PromptTest]) -> dict:
    results = {"passed": 0, "failed": 0, "details": []}
    
    for test in tests:
        prompt = prompt_template.format(input=test.input)
        response = llm_call(prompt)
        
        passed = True
        errors = []
        
        if test.expected_contains:
            for phrase in test.expected_contains:
                if phrase.lower() not in response.lower():
                    passed = False
                    errors.append(f"Missing: {phrase}")
        
        if test.expected_not_contains:
            for phrase in test.expected_not_contains:
                if phrase.lower() in response.lower():
                    passed = False
                    errors.append(f"Should not contain: {phrase}")
        
        if test.validator and not test.validator(response):
            passed = False
            errors.append("Custom validation failed")
        
        results["passed" if passed else "failed"] += 1
        results["details"].append({
            "test": test.name,
            "passed": passed,
            "errors": errors
        })
    
    return results

Example Prompts Library

Here’s a curated collection of proven prompts for various use cases. Adapted from Awesome ChatGPT Prompts.

Act as a Linux Terminal

I want you to act as a Linux terminal. I will type commands and you will reply 
with what the terminal should show. I want you to only reply with the terminal 
output inside one unique code block, and nothing else. Do not write explanations. 
Do not type commands unless I instruct you to do so. When I need to tell you 
something in English, I will do so by putting text inside curly brackets {like this}. 
My first command is pwd

Act as a Tech Interviewer

I want you to act as an interviewer. I will be the candidate and you will ask 
me the interview questions for the position of [Senior Backend Engineer]. 
I want you to only reply as the interviewer. Do not write all the conversation at once. 
I want you to only do the interview with me. Ask me the questions and wait for my 
answers. Do not write explanations. Ask me the questions one by one like an 
interviewer does and wait for my answers. My first sentence is "Hi"

Act as a SQL Expert

I want you to act as a SQL expert. I have a database with the following tables:
- users (id, name, email, created_at)
- orders (id, user_id, total, status, created_at)
- products (id, name, price, category)
- order_items (id, order_id, product_id, quantity)

When I describe what I want, write the SQL query to achieve it. 
Explain your query briefly. Optimize for readability first, then performance.

Act as a Code Reviewer

I want you to act as a senior code reviewer. Review the code I provide and:
1. Identify bugs and potential issues
2. Suggest improvements for readability and maintainability
3. Point out security vulnerabilities
4. Recommend performance optimizations

Be constructive and explain WHY something is an issue. Provide specific fixes.
Rate the overall code quality from 1-10.

Act as a UX/UI Developer

I want you to act as a UX/UI developer. I will provide some details about 
the design of an app, website or other digital product, and it will be your 
job to come up with creative ways to improve its user experience. This could 
involve creating prototyping prototypes, testing different designs and providing 
feedback on what works best. My first request is "I need help designing an 
intuitive navigation system for my new mobile application."

Act as a Regex Generator

I want you to act as a regex generator. Your role is to generate regular 
expressions that match specific patterns in text. You should provide the 
regex in a format that can be easily copied and pasted into a regex-enabled 
text editor or programming language. Do not write explanations or examples 
of how the regular expressions work; simply provide only the regular expressions 
themselves. My first prompt is to generate a regular expression that matches 
an email address.

Act as a Commit Message Generator

I want you to act as a commit message generator. I will provide you with 
information about the task and the prefix for the task code, and I would 
like you to generate an appropriate commit message using the conventional 
commit format. Do not write any explanations or other words, just reply 
with the commit message.

Format: <type>(<scope>): <subject>
Types: feat, fix, docs, style, refactor, test, chore

Act as a Prompt Optimizer

I want you to act as a prompt engineer. I will provide you with a prompt, 
and your job is to improve it for better LLM performance. Consider:
1. Clarity and specificity
2. Adding relevant context
3. Including output format
4. Adding few-shot examples if helpful
5. Breaking complex tasks into steps

Explain what you changed and why. Then provide the optimized prompt.

Act as a Diagram Generator (Mermaid)

I want you to act as a Mermaid diagram generator. Create diagrams based 
on my descriptions using Mermaid syntax. Support flowcharts, sequence 
diagrams, class diagrams, and entity relationship diagrams. Output only 
the Mermaid code wrapped in a code block. Do not add explanations unless asked.

Act as a Technical Writer

I want you to act as a tech writer. You will act as a creative and engaging 
technical writer and create guides on how to do different things. I will 
provide you with a topic and you will write:
1. A clear introduction explaining the topic
2. Step-by-step instructions
3. Code examples where relevant
4. Common pitfalls and how to avoid them
5. A brief summary

Use markdown formatting. My first topic is: [topic]

Find 200+ more prompts at prompts.chat - an open-source collection of prompts for various use cases.

Key Takeaways

Be Specific

Vague prompts get vague answers. Specify format, length, tone, and constraints.

Show, Don't Tell

Few-shot examples are worth a thousand words of instruction.

Think in Steps

Chain-of-thought improves reasoning. Break complex tasks into chains.

Test and Iterate

Prompts need testing like code. Build a test suite for critical prompts.

What’s Next

OpenAI API

Apply your prompt engineering skills with the OpenAI API

Overview

Testing & Code Quality

Crash Courses

AI Engineering

Math for ML - Understanding Linear Algebra

Probability & Statistics for ML

Math for ML - Understanding Calculus

ML Mastery

Deep Learning Mastery

NestJS Mastery

Microservices Mastery

Low Level Design

OOP Concepts

SOLID Principles

Design Patterns

LLD Case Studies

System Design (HLD)

Senior Level (L5+/Staff)

HLD Case Studies

Engineering Fundamentals

DevOps & Operations

Azure Cloud Engineering

AWS Cloud

AWS Monitoring & Observability

AWS Security Services

AWS Serverless

AWS Operations

AWS Advanced

AWS Case Studies

GCP Cloud Engineering

DevOps Tools

Database Engineering

HIPAA Compliance Mastery

Operating Systems

Linux Internals

Distributed Systems

Networking Mastery

Build Your Own X

Go Lang Mastery

C Programming

Classic Research Papers

Distributed System Tools

​Why Prompts Matter

​The Anatomy of a Great Prompt

​System Prompts: Your AI’s DNA

​Basic Structure

​Production System Prompt

​Few-Shot Learning

​Why Few-Shot Works

​Few-Shot Template

​Chain-of-Thought (CoT)

​The Problem

​The Solution

​Zero-Shot CoT

​Structured CoT

​Advanced Techniques

​Self-Consistency

​Prompt Chaining

​Role Prompting

​Constitutional AI (Self-Critique)

​Prompt Templates Library

​Summarization

​Data Extraction

​Classification

​Translation with Context

​Debugging Prompts

​Common Issues and Fixes

​Prompt Testing Framework

​Example Prompts Library

​Act as a Linux Terminal

​Act as a Tech Interviewer

​Act as a SQL Expert

​Act as a Code Reviewer

​Act as a UX/UI Developer

​Act as a Regex Generator

​Act as a Commit Message Generator

​Act as a Prompt Optimizer

​Act as a Diagram Generator (Mermaid)

​Act as a Technical Writer

​Key Takeaways

Why Prompts Matter

The Anatomy of a Great Prompt

System Prompts: Your AI’s DNA

Basic Structure

Production System Prompt

Few-Shot Learning

Why Few-Shot Works

Few-Shot Template

Chain-of-Thought (CoT)

The Problem

The Solution

Zero-Shot CoT

Structured CoT

Advanced Techniques

Self-Consistency

Prompt Chaining

Role Prompting

Constitutional AI (Self-Critique)

Prompt Templates Library

Summarization

Data Extraction

Classification

Translation with Context

Debugging Prompts

Common Issues and Fixes

Prompt Testing Framework

Example Prompts Library

Act as a Linux Terminal

Act as a Tech Interviewer

Act as a SQL Expert

Act as a Code Reviewer

Act as a UX/UI Developer

Act as a Regex Generator

Act as a Commit Message Generator

Act as a Prompt Optimizer

Act as a Diagram Generator (Mermaid)

Act as a Technical Writer

Key Takeaways

What’s Next