Semantic Kernel Python Integration

Install and use Microsoft's Semantic Kernel SDK to build AI agents, multi-agent systems, and orchestrated AI workflows in Python.

What This Skill Does

Semantic Kernel is a flexible agent framework that enables:

Building and deploying AI agents with custom tools (plugins)

Orchestrating multi-agent systems with collaboration patterns

Prompt engineering with templating and structured outputs

Integrating 20+ LLM providers (OpenAI, Azure OpenAI, Hugging Face, Mistral, Google AI, Ollama, and more)

Connecting to vector databases (Azure AI Search, Elasticsearch, Chroma)

Modeling structured business processes with the Process Framework

Handling multimodal content (text, vision, audio)

Installation

Install the base package:

```bash

pip install --upgrade semantic-kernel

```

Install with optional integrations:

```bash

Hugging Face models

pip install --upgrade semantic-kernel[hugging_face]

All integrations

pip install --upgrade semantic-kernel[all]

```

**Requirements:**

Python 3.10 or higher

Compatible with Windows, macOS, Linux

Configuration

1. Set API Keys

Create a `.env` file in your project root or set environment variables:

```bash

OpenAI

OPENAI_API_KEY=sk-...

OPENAI_CHAT_MODEL_ID=gpt-4

Azure OpenAI

AZURE_OPENAI_API_KEY=...

AZURE_OPENAI_ENDPOINT=https://your-resource.openai.azure.com/

AZURE_OPENAI_CHAT_DEPLOYMENT_NAME=gpt-4

```

Or pass configuration directly to the service constructor:

```python

from semantic_kernel.connectors.ai.open_ai import AzureChatCompletion

chat_service = AzureChatCompletion(

api_key="your-api-key",

endpoint="https://your-resource.openai.azure.com/",

deployment_name="gpt-4",

)

```

2. Choose Your LLM Provider

Semantic Kernel supports:

**OpenAI** (`OpenAIChatCompletion`)

**Azure OpenAI** (`AzureChatCompletion`)

**Hugging Face** (requires `semantic-kernel[hugging_face]`)

**Mistral, Google AI, Ollama, NVIDIA NIM, ONNX** (see [connectors documentation](https://learn.microsoft.com/en-us/semantic-kernel/))

Usage Patterns

Pattern 1: Prompt Engineering with Kernel

Use the `Kernel` to invoke templated prompts with variable substitution:

```python

import asyncio

from semantic_kernel import Kernel

from semantic_kernel.connectors.ai.open_ai import OpenAIChatCompletion

from semantic_kernel.functions import KernelArguments

kernel = Kernel()

kernel.add_service(OpenAIChatCompletion())

prompt = """

Summarize the following text in exactly {{$num_words}} words:

"""

async def main():

result = await kernel.invoke_prompt(

prompt,

arguments=KernelArguments(

num_words=10,

text="Semantic Kernel is a flexible AI orchestration framework..."

)

print(result)

asyncio.run(main())

```

**Use this when:** You need templated prompts with variable injection, or want centralized AI service management.

Pattern 2: Direct AI Service Usage

Call AI services directly without the Kernel abstraction for low-level control:

```python

import asyncio

from semantic_kernel.connectors.ai.open_ai import (

OpenAIChatCompletion,

OpenAIChatPromptExecutionSettings

)

from semantic_kernel.contents import ChatHistory

async def main():

service = OpenAIChatCompletion()

settings = OpenAIChatPromptExecutionSettings(temperature=0.7)

chat_history = ChatHistory(system_message="You are a helpful assistant.")

chat_history.add_user_message("Write a haiku about AI agents.")

response = await service.get_chat_message_content(

chat_history=chat_history,

settings=settings

)

print(response.content)

asyncio.run(main())

```

**Use this when:** You need full control over chat history, settings, or streaming responses.

Pattern 3: Building Agents with Tools (Plugins)

Create agents with custom Python functions as tools and structured outputs:

```python

import asyncio

from typing import Annotated

from pydantic import BaseModel

from semantic_kernel.agents import ChatCompletionAgent

from semantic_kernel.connectors.ai.open_ai import (

AzureChatCompletion,

OpenAIChatPromptExecutionSettings

)

from semantic_kernel.functions import kernel_function, KernelArguments

Define a plugin with tool functions

class RestaurantPlugin:

@kernel_function(description="Get today's menu specials")

def get_specials(self) -> Annotated[str, "Returns menu specials"]:

return "Today's specials: Lobster Bisque ($12), Caesar Salad ($8)"

@kernel_function(description="Get price for a menu item")

def get_price(

self, item: Annotated[str, "Menu item name"]

) -> Annotated[str, "Returns price"]:

prices = {"Lobster Bisque": "$12", "Caesar Salad": "$8"}

return prices.get(item, "Item not found")

Define structured output format

class MenuItem(BaseModel):

name: str

price: float

async def main():

settings = OpenAIChatPromptExecutionSettings()

settings.response_format = MenuItem # Force structured output

agent = ChatCompletionAgent(

service=AzureChatCompletion(),

name="Restaurant-Assistant",

instructions="You help customers with menu questions.",

plugins=[RestaurantPlugin()],

arguments=KernelArguments(settings=settings)

)

response = await agent.get_response("What's the soup special price?")

print(response.content)

asyncio.run(main())

```

**Use this when:** You need AI agents that can call external tools/APIs or return structured data (Pydantic models).

Pattern 4: Multi-Agent Orchestration

Coordinate multiple specialized agents to collaborate on complex tasks:

```python

import asyncio

from semantic_kernel.agents import (

ChatCompletionAgent,

GroupChatOrchestration,

RoundRobinGroupChatManager

)

from semantic_kernel.agents.runtime import InProcessRuntime

from semantic_kernel.connectors.ai.open_ai import AzureChatCompletion

async def main():

# Define specialized agents

writer = ChatCompletionAgent(

name="Writer",

instructions="Generate creative marketing copy.",

service=AzureChatCompletion(),

)

reviewer = ChatCompletionAgent(

name="Reviewer",

instructions="Critically evaluate copy and suggest improvements.",

service=AzureChatCompletion(),

)

# Set up group chat with round-robin orchestration

group_chat = GroupChatOrchestration(

members=[writer, reviewer],

manager=RoundRobinGroupChatManager(max_rounds=3),

)

runtime = InProcessRuntime()

runtime.start()

result = await group_chat.invoke(

task="Create a tagline for an eco-friendly water bottle.",

runtime=runtime,

)

final_output = await result.get()

print(f"Final Tagline: {final_output}")

await runtime.stop_when_idle()

asyncio.run(main())

```

**Use this when:** You need multiple AI agents to iteratively collaborate (e.g., writer → reviewer → editor workflows).

Advanced Features

Process Framework

Model structured business workflows with the Process Framework (separate from agents):

```python

from semantic_kernel.processes import Process, ProcessStep

Define workflow steps

class DataValidationStep(ProcessStep):

async def execute(self, context):

# Validation logic

pass

See full examples: https://github.com/microsoft/semantic-kernel/tree/main/python/samples/getting_started_with_processes

```

Vector Database Integration

Connect to vector stores for retrieval-augmented generation (RAG):

```python

from semantic_kernel.connectors.memory.azure_ai_search import AzureAISearchMemoryStore

Supported: Azure AI Search, Elasticsearch, Chroma, and more

```

Streaming Responses

Stream LLM responses token-by-token:

```python

async for message in service.get_streaming_chat_message_contents(chat_history, settings):

print(message.content, end="", flush=True)

```

Constraints & Best Practices

1. **Async-First Design**: All LLM calls use `asyncio` — wrap code in `async def main()` and run with `asyncio.run(main())`

2. **Rate Limits**: Respect provider rate limits (especially OpenAI) — implement retry logic or use built-in retry settings

3. **Token Costs**: Monitor token usage with `response.metadata["usage"]` — long chat histories increase costs

4. **Plugin Security**: Never expose sensitive operations in plugins without validation — treat plugins as external APIs

5. **Structured Outputs**: Use Pydantic models + `response_format` for reliable JSON extraction (requires OpenAI models with JSON mode)

Examples & Resources

**Agent Samples**: [Getting Started with Agents](https://github.com/microsoft/semantic-kernel/tree/main/python/samples/getting_started_with_agents)

**Process Framework**: [Getting Started with Processes](https://github.com/microsoft/semantic-kernel/tree/main/python/samples/getting_started_with_processes)

**Interactive Notebooks**: [Python Notebooks](https://github.com/microsoft/semantic-kernel/tree/main/python/samples/getting_started)

**Official Docs**: [Semantic Kernel Python Documentation](https://learn.microsoft.com/en-us/semantic-kernel/get-started/quick-start-guide?pivots=programming-language-python)

When to Use Semantic Kernel vs. Alternatives

**Use Semantic Kernel when:**

You need multi-agent orchestration with structured collaboration patterns

You want vendor-agnostic LLM integration (easy provider switching)

You're building enterprise AI systems requiring process modeling

You need plugins/tools with automatic function calling

**Consider alternatives when:**

**LangChain**: You need mature RAG chains and vector store integrations (more battle-tested)

**LlamaIndex**: Your primary use case is data indexing/retrieval

**OpenAI SDK**: You only need OpenAI models and want minimal abstraction

**Guidance/LMQL**: You need precise control over LLM output structure with grammars

Troubleshooting

**Issue**: `ImportError: No module named 'semantic_kernel'`

**Fix**: Run `pip install --upgrade semantic-kernel` in your virtual environment

**Issue**: Authentication errors with Azure OpenAI

**Fix**: Verify `AZURE_OPENAI_API_KEY` and `AZURE_OPENAI_ENDPOINT` are set correctly. Check Azure portal for correct deployment name.

**Issue**: Agents not calling plugins

**Fix**: Ensure plugin methods have `@kernel_function` decorator and clear `description` parameters. Use function calling–capable models (GPT-4, GPT-3.5-turbo).

**Issue**: Structured outputs return plain text

**Fix**: Verify your model supports JSON mode (OpenAI GPT-4/3.5-turbo). Set `response_format=YourPydanticModel` in `OpenAIChatPromptExecutionSettings`.

Next Steps

1. **Start simple**: Begin with Pattern 1 (prompt engineering) to learn the Kernel API

2. **Add tools**: Experiment with Pattern 3 to give agents custom capabilities

3. **Scale to multi-agent**: Use Pattern 4 for complex workflows requiring specialization

4. **Explore examples**: Clone the [Semantic Kernel repo](https://github.com/microsoft/semantic-kernel) and run samples locally

5. **Join the community**: [Discord](https://aka.ms/SKDiscord) | [GitHub Discussions](https://github.com/microsoft/semantic-kernel/discussions)

Semantic Kernel Python Integration

Semantic Kernel Python Integration

What This Skill Does

Installation

Hugging Face models

All integrations

Configuration

1. Set API Keys

OpenAI

Azure OpenAI

2. Choose Your LLM Provider

Usage Patterns

Pattern 1: Prompt Engineering with Kernel

Pattern 2: Direct AI Service Usage

Pattern 3: Building Agents with Tools (Plugins)

Define a plugin with tool functions

Define structured output format

Pattern 4: Multi-Agent Orchestration

Advanced Features

Process Framework

Define workflow steps

See full examples: https://github.com/microsoft/semantic-kernel/tree/main/python/samples/getting_started_with_processes

Vector Database Integration

Supported: Azure AI Search, Elasticsearch, Chroma, and more

Streaming Responses

Constraints & Best Practices

Examples & Resources

When to Use Semantic Kernel vs. Alternatives

Troubleshooting

Next Steps

Reviews (0)