# Agentic Engineering ## Agenda 1. What is Agentic Engineering? 2. The Evolution: Coding โ Vibe Coding โ Agentic Engineering 3. The Four Agentic Design Patterns 4. Agent Architecture: How Agents Work 5. Agentic Coding Tools Landscape 6. Agentic Frameworks for Developers 7. The Protocols: MCP and A2A 8. Best Practices 9. Risks & Challenges 10. The Developer's New Role 11. Demo: Multi-Agent Workflow with Semantic Kernel 12. Key Takeaways & Resources --- ## 1. What is Agentic Engineering? **Agentic Engineering** is the practice of defining goals, constraints, and quality standards for AI agents, who then **autonomously plan, create, test, and refine** software and systems. Instead of writing every line of code, engineers: - ๐ฏ **Design workflows** and specify intentions - ๐ค **Delegate tasks** to AI agents - ๐ **Review and validate** outputs - ๐ **Iterate** with structured oversight --- ## 1. Key Characteristics | Characteristic | Description | | ------------------------ | --------------------------------------------------------------------- | | **Autonomy** | Agents act with minimal intervention, handling multi-step workflows | | **Goal-Oriented** | Engineers specify intentions and constraints, not procedural details | | **Iterative Validation** | Human oversight ensures outputs meet requirements | | **Collaboration** | Agents coordinate with each other and escalate to humans | | **Layered Oversight** | Spectrum from suggestion to full workflow execution under supervision | > ๐ Ref: > [Glide โ "What is Agentic Engineering?"](https://www.glideapps.com/blog/what-is-agentic-engineering) > ยท [Kilo.ai โ Agentic Engineering](https://path.kilo.ai/introduction/what-is-agentic-engineering/) --- ## 2. The Evolution: Coding โ Vibe Coding โ Agentic Engineering The way we build software has been transforming rapidly:
flowchart LR A["๐ฅ๏ธ Traditional
Coding"] --> B["๐ค AI-Assisted
Coding"] B --> C["๐ต Vibe
Coding"] C --> D["โ๏ธ Agentic
Engineering"]
### Key Milestones - **2021** โ GitHub Copilot technical preview (AI autocomplete enters mainstream) - **2024** โ Andrej Karpathy coins **"Vibe Coding"** โ "give in to the vibes," accept AI suggestions with minimal scrutiny - **2025** โ Limitations of vibe coding surface โ compounded bugs, chaotic codebases - **2025-2026** โ **Agentic Engineering** emerges โ from accepting suggestions to orchestrating fleets of AI agents --- ## 2. The Evolution โ Comparison | | Traditional Coding | Vibe Coding | Agentic Engineering | | ---------- | -------------------- | --------------------------- | ------------------------------------- | | Human Role | Writes code directly | Prompts AI, accepts output | Orchestrates agents, reviews, manages | | AI Role | Suggests/assists | Generates code autonomously | Plans, codes, tests, iterates | | Oversight | Full | Partial | Structured, layered | | Output | Deterministic | Stochastic, varied | Goal-oriented, quality-controlled | | Risk | Human bottleneck | Errors, lack of rigor | Oversight, governance | > ๐ Ref: > [Forbes โ "From Vibe Coding to Agentic Engineering"](https://www.forbes.com/councils/forbestechcouncil/2025/08/21/from-vibe-coding-to-agentic-engineering-redefining-sdlc-with-one-pizza-teams/) > ยท > [Morph LLM โ "The 2026 Paradigm Shift"](https://www.morphllm.com/blog/vibe-coding-to-agentic-engineering) --- ## 3. The Four Agentic Design Patterns Andrew Ng (DeepLearning.AI) identified **four foundational design patterns** that make AI agents far more capable than single-shot prompting. --- ## 3a. ๐ Reflection The agent **critiques, evaluates, and iteratively improves** its own output.
flowchart LR A[Generate] --> B[Evaluate] --> C[Revise] C -. "loop until
quality met" .-> A
- Code generators that review and debug their own code - Even simpler models, when used with reflection, **outperform advanced models** used without it --- ## 3b. ๐ง Tool Use Agents access **external tools** โ APIs, databases, web search, code execution โ extending beyond the LLM's built-in knowledge. - Research agents that search the web for up-to-date information - Agents that execute code, query databases, or interact with business software ## 3c. ๐ Planning Decompose a complex problem into a **sequence of actionable steps**, then execute them โ adjusting dynamically as new information emerges. - Break a project into: requirements โ design โ implementation โ testing - Dynamic replanning when intermediate steps fail --- ## 3d. ๐ฅ Multi-Agent Collaboration Distribute tasks among **multiple specialized agents** that interact, debate, critique, and combine their efforts. - "Planner" + "Coder" + "Reviewer" + "Integrator" agent teams - Debate systems where agents argue opposing sides to reach consensus > ๐ Ref: > [Andrew Ng โ Sequoia AI Ascent 2024](https://octetdata.com/blog/notes-andrew-ng-agentic-reasoning-2024/) > ยท [DeepLearning.AI โ Agentic AI Patterns](https://github.com/madeeha96/agentic-ai) ยท > [Continuum Labs โ Andrew Ng's presentation](https://training.continuumlabs.ai/agents/what-is-agency/andrew-ngs-presentation-on-ai-agents) --- ## 4. Agent Architecture: How Agents Work At their core, AI agents follow a **Perceive โ Plan โ Act โ Reflect** loop:
flowchart LR P["๐ Perceive"] --> PL["๐ Plan"] --> A["โก Act"] --> R["๐ Reflect"] R -. loop .-> P
--- ## 4. Memory Systems & Bounded Autonomy ### Memory Systems | Type | Purpose | Example | | -------------------------- | ------------------------- | ---------------------------------------------- | | **Short-term (Working)** | Current task context | Conversation history, current plan | | **Long-term (Persistent)** | Knowledge across sessions | Embeddings, vector stores, learned preferences | | **Episodic** | Past experiences | Previous task outcomes, error patterns | ### Key Principle: Bounded Autonomy Agents are **not fully autonomous**. They operate within defined boundaries: - What tools they can use - What actions require human approval - When to escalate vs. proceed independently > ๐ Ref: > [Google Cloud โ "What is Agentic AI?"](https://cloud.google.com/discover/what-is-agentic-ai) ยท > [arxiv โ "Agentic Software Engineering: Foundational Pillars"](https://arxiv.org/pdf/2509.06216) --- ## 5. Agentic Coding Tools Landscape The tools we use to write software are becoming **agentic** โ moving from autocomplete to autonomous coding: | Tool | Best For | Standout Feature | Autonomy | | ------------------------ | ----------------- | --------------------------------------------- | --------- | | **GitHub Copilot Agent** | Teams, compliance | Tightest IDE integration (VS Code, JetBrains) | High | | **Cursor** | Complex projects | Multi-file refactor, AI-native IDE | High | | **Windsurf** | Enterprises | Cascade engine, auto linter fixes | High | | **Claude Code** | DevOps, CLI users | Deep code reasoning, terminal-based | High | | **Devin** | Full automation | End-to-end autonomous engineer | Very High | | **OpenAI Codex** | CLI power-users | Multi-modal CLI, open models | Moderate | --- ## 5. What Makes Them "Agentic"? Unlike traditional autocomplete, these tools can: - โ **Plan** multi-step changes across files - โ **Execute** code, run tests, read errors - โ **Iterate** โ fix their own mistakes based on test/lint output - โ **Use tools** โ terminal, browser, file system - โ **Reflect** โ evaluate their output and improve it > ๐ Ref: > [aistatus.org โ "Best Agentic Coding Tools"](https://aistatus.org/best-agentic-coding-tools) ยท > [aiagents.bot โ "Best AI Coding Assistants"](https://aiagents.bot/blog/best-ai-coding-assistants-2025) --- ## 6. Agentic Frameworks for Developers Four major frameworks dominate the agentic AI space. --- ## 6a. LangGraph & AutoGen ### LangGraph (LangChain) - **Graph-based state machine** architecture for stateful, cyclical workflows - Enterprise-ready: error handling, checkpointing, durable execution - Used by Klarna, Uber, Replit - โ ๏ธ Steep learning curve ### AutoGen (Microsoft) - **Conversation-driven** multi-agent system - Agents delegate tasks and communicate through dialogue - Great for rapid prototyping of multi-agent capabilities --- ## 6b. CrewAI & Semantic Kernel ### CrewAI - **Role-based team** paradigm โ each agent specializes in a skill - Centralized task delegation and coordination - Lower token usage and latency in benchmarks ### Semantic Kernel (Microsoft) - **Plugin/skill architecture** for enterprise integration - Composable connectors for existing APIs and business processes - Production-grade orchestration, logging, monitoring --- ## 6. Framework Comparison | Framework | Architecture | Best For | Learning Curve | | ------------------- | -------------------------- | ----------------------------- | -------------- | | **LangGraph** | Graph-based state machines | Production, complex workflows | Steep | | **AutoGen** | Conversation-driven | Multi-agent prototyping | Moderate | | **CrewAI** | Role-based teams | Structured collaboration | Easy | | **Semantic Kernel** | Plugin/skill-based | Enterprise integration | Moderate | > ๐ Ref: [Turing โ "AI Agent Frameworks"](https://www.turing.com/resources/ai-agent-frameworks) ยท > [arxiv โ "Agentic AI Frameworks: Architectures, Protocols"](https://arxiv.org/html/2508.10146v1) ยท > [mem0.ai โ "Agentic Frameworks Guide"](https://mem0.ai/blog/agentic-frameworks-ai-agents) --- ## 7. The Protocols: MCP and A2A Two open protocols are standardizing how agents connect to the world and to each other. --- ## 7a. MCP โ Model Context Protocol > "The USB port for AI" โ a universal way for LLMs to connect to external data, tools, and services. Introduced by **Anthropic** in November 2024.
flowchart LR C["๐ค AI Client
(Claude, etc)"] <-- "JSON-RPC 2.0" --> S["๐ง MCP Server
(Tools/Data)"]
- **Resources** โ structured data (documents, DB rows) - **Tools** โ executable functions (APIs, scripts) - **Prompts** โ workflow templates - Inspired by the **Language Server Protocol** (LSP) - Adopted by OpenAI, Google DeepMind; donated to the **Linux Foundation** --- ## 7b. A2A โ Agent-to-Agent Protocol > Enables secure, interoperable communication **between agents**, regardless of vendor or framework. Introduced by **Google** in 2025.
flowchart LR A["๐ค Agent A
(Any vendor)"] <-- "A2A Protocol" --> B["๐ค Agent B
(Any vendor)"]
- **Agent Cards** โ JSON capability manifests at well-known endpoints - **Tasks** โ formal lifecycle: submitted โ working โ completed/failed - **Opaque execution** โ agents don't expose internal state - 150+ partners; governed by the **Linux Foundation** --- ## 7c. MCP vs A2A โ Complementary, Not Competing | Aspect | MCP | A2A | | ------------ | ------------------------------------------------------------------------- | ---------------------------------------- | | **Focus** | Agent โ Tools/Data | Agent โ Agent | | **Purpose** | Give an agent access to external capabilities | Let agents collaborate across boundaries | | **Analogy** | USB port (connect peripherals) | Network protocol (connect computers) | | **Together** | MCP enriches each agent's capabilities; A2A lets those agents collaborate | > ๐ Ref: > [Anthropic โ "Introducing the Model Context Protocol"](https://www.anthropic.com/news/model-context-protocol) > ยท > [Google Developers Blog โ "Announcing A2A"](https://developers.googleblog.com/en/a2a-a-new-era-of-agent-interoperability/) > ยท > [Galileo โ "A2A Protocol Explained"](https://galileo.ai/blog/google-agent2agent-a2a-protocol-guide) --- ## 8. Best Practices for Agentic Engineering ### ๐ Identity & Scope - Treat each agent as a **distinct, non-human principal** - Assign narrow, well-defined roles and permissions - Avoid over-privileged service accounts ### ๐ฏ Bounded Autonomy - Limit actions agents can take **without human approval** - Especially for high-risk or business-impacting operations - Implement logging and rationale-tracking for significant actions ### ๐๏ธ Observability & Traceability - Embed robust monitoring to track agent actions and outputs - Maintain **full audit logs** for investigation and improvement - Know what your agents are doing at all times --- ## 8. Best Practices (continued) ### ๐ก๏ธ Policy Enforcement at Boundaries - Shift from "prompt engineering" to **hard controls** at boundaries - Where agents interact with identities, tools, data, and output - Regularly review and enforce boundary policies ### ๐ค Human-in-the-Loop - For high-stakes or regulated domains, **maintain human oversight** - The EU AI Act may mandate human oversight for certain applications - Design escalation paths for edge cases ### ๐ Lifecycle Vigilance - Address risks throughout the **entire lifecycle**: development, training, deployment, operation - Not just during initial deployment > ๐ Ref: > [MIT Technology Review โ "From Guardrails to Governance"](https://www.technologyreview.com/2026/02/04/1131014/from-guardrails-to-governance-a-ceos-guide-for-securing-agentic-systems/) > ยท > [McKinsey โ "Deploying Agentic AI with Safety and Security"](https://www.mckinsey.com/capabilities/risk-and-resilience/our-insights/deploying-agentic-ai-with-safety-and-security-a-playbook-for-technology-leaders) > ยท [IBM โ Agentic AI Security Guide](https://www.ibm.com/think/insights/agentic-ai-security) --- ## 9. Risks & Challenges ### โ ๏ธ Hallucination & Misalignment - Agents can **make mistakes** (hallucinate outputs) or misinterpret goals - Especially dangerous when agents take **real-world actions** based on wrong conclusions - Requires output verification and feedback loops ### ๐ Cybersecurity Threat Surfaces - Agents are **"digital insiders"** โ if compromised, system-wide impact - Expands attack surfaces beyond simple prompt injection - Must anticipate insider-style attacks (intentional or unintentional) ### ๐งฉ Orchestration Complexity - Multiple agents working in concert โ ensuring consistent behavior is hard - Avoiding conflicts and securing multi-agent workflows grows harder at scale - Need formal coordination protocols --- ## 9. Risks & Challenges (continued) ### โ๏ธ Legal & Ethical Risks - Misaligned output can create **significant liability** - Privacy violations, bias, regulatory infractions - Organizations can be held liable for agent errors ### ๐ Governance Gaps - Many organizations **lack mature frameworks** for agent governance - No clear inventories of deployed agents and their privileges - OWASP released **Top 10 Risks for Agentic AI** in December 2025 > ๐ Ref: > [MIT Sloan โ "Agentic AI, explained"](https://mitsloan.mit.edu/ideas-made-to-matter/agentic-ai-explained) > ยท > [Deloitte โ "Agentic AI Orchestration & Governance"](https://www.deloitte.com/us/en/what-we-do/capabilities/applied-artificial-intelligence/articles/agentic-ai-orchestration-governance.html) > ยท > [OWASP โ "Top 10 Risks for Agentic AI"](https://genai.owasp.org/2025/12/09/owasp-genai-security-project-releases-top-10-risks-and-mitigations-for-agentic-ai-security/) > ยท > [IAPP โ "AI Governance in the Agentic Era"](https://iapp.org/resources/article/ai-governance-in-the-agentic-era) --- ## 10. The Developer's New Role The role of a software developer is **not disappearing** โ it's **evolving**: ### From Code Writer โ Orchestrator
flowchart LR subgraph Before B1["Developer
writes code, runs tests,
debugs, deploys"] end subgraph Now N1["Developer
defines goals, designs workflows,
reviews output, validates quality"] end Before --> Now
--- ## 10. Skills to Develop | Traditional Skill | Agentic Equivalent | | ----------------- | ----------------------------------------- | | Writing code | Designing agent workflows | | Debugging | Evaluating agent outputs | | Code review | Agent output validation | | Architecture | System-of-agents design | | Testing | Defining quality constraints & guardrails | ### "One Pizza Teams" with AI Agents - Small teams (2-3 humans) + AI agents as team members - Humans set strategy, agents execute - **The engineer who can effectively orchestrate AI agents will be 10x more productive** > ๐ Ref: > [Forbes โ "From Vibe Coding to Agentic Engineering: Redefining SDLC"](https://www.forbes.com/councils/forbestechcouncil/2025/08/21/from-vibe-coding-to-agentic-engineering-redefining-sdlc-with-one-pizza-teams/) --- ## 11. Demo: Multi-Agent Workflow with Semantic Kernel A .NET console application demonstrating the **Multi-Agent Collaboration** pattern:
flowchart LR I["๐ User Input"] --> R["๐ Researcher"] R --> W1["โ๏ธ Writer"] W1 --> Rev["๐ Reviewer"] Rev --> W2["๐ Writer revises"]
--- ## 11. Patterns Demonstrated - โ **Multi-Agent Collaboration** โ three specialized agents - โ **Planning** โ Researcher decomposes the topic - โ **Reflection** โ Reviewer critiques, Writer improves - โ **Tool Use** โ Semantic Kernel plugin architecture ```bash cd demo/AgenticDemo dotnet run ``` --- ## 12. Key Takeaways | # | Takeaway | | --- | -------------------------------------------------------------------------------------------------- | | 1 | Agentic Engineering is the **next evolution** beyond vibe coding โ from prompting to orchestrating | | 2 | Andrew Ng's **4 patterns** (Reflection, Tool Use, Planning, Multi-Agent) are the building blocks | | 3 | Agents follow a **Perceive โ Plan โ Act โ Reflect** loop with bounded autonomy | | 4 | **MCP** (Anthropic) connects agents to tools; **A2A** (Google) connects agents to each other | | 5 | Frameworks like **Semantic Kernel**, LangGraph, AutoGen, and CrewAI make it practical | | 6 | Coding tools (Copilot Agent, Cursor, Windsurf) are **already agentic** | | 7 | **Best practices**: bounded autonomy, observability, human-in-the-loop, policy enforcement | | 8 | **Risks are real**: hallucination, security, governance โ OWASP published Top 10 for Agentic AI | | 9 | The developer's role is **evolving**, not disappearing โ from writer to orchestrator | --- ## Resources ### Articles & Reports - ๐ [MIT Sloan โ "Agentic AI, explained"](https://mitsloan.mit.edu/ideas-made-to-matter/agentic-ai-explained) - ๐ [McKinsey โ "Deploying Agentic AI with Safety and Security"](https://www.mckinsey.com/capabilities/risk-and-resilience/our-insights/deploying-agentic-ai-with-safety-and-security-a-playbook-for-technology-leaders) - ๐ [Forbes โ "From Vibe Coding to Agentic Engineering"](https://www.forbes.com/councils/forbestechcouncil/2025/08/21/from-vibe-coding-to-agentic-engineering-redefining-sdlc-with-one-pizza-teams/) - ๐ [Glide โ "What is Agentic Engineering?"](https://www.glideapps.com/blog/what-is-agentic-engineering) - ๐ [IBM โ Agentic AI Security Guide](https://www.ibm.com/think/insights/agentic-ai-security) --- ## Resources (continued) ### Academic Papers - ๐ [arxiv โ "Agentic Software Engineering: Foundational Pillars and a Research Roadmap"](https://arxiv.org/pdf/2509.06216) - ๐ [Springer โ "Agentic AI: a comprehensive survey of architectures"](https://link.springer.com/article/10.1007/s10462-025-11422-4) - ๐ [arxiv โ "Agentic AI Frameworks: Architectures, Protocols, and Design Challenges"](https://arxiv.org/html/2508.10146v1) ### Protocols & Frameworks - ๐ [Anthropic โ Model Context Protocol (MCP)](https://www.anthropic.com/news/model-context-protocol) - ๐ [Google โ Agent-to-Agent Protocol (A2A)](https://developers.googleblog.com/en/a2a-a-new-era-of-agent-interoperability/) - ๐ [Microsoft โ Semantic Kernel](https://learn.microsoft.com/en-us/semantic-kernel/overview/) - ๐ [OWASP โ Top 10 Risks for Agentic AI](https://genai.owasp.org/2025/12/09/owasp-genai-security-project-releases-top-10-risks-and-mitigations-for-agentic-ai-security/) ### Talks & Courses - ๐ฅ [Andrew Ng โ Sequoia AI Ascent 2024: Agentic Reasoning](https://octetdata.com/blog/notes-andrew-ng-agentic-reasoning-2024/) - ๐ [DeepLearning.AI โ Agentic AI Patterns Course](https://github.com/madeeha96/agentic-ai) ### Demo - ๐ง [Demo Project โ `./demo/AgenticDemo`](./demo/AgenticDemo/)