Posts tagged with "Cost Optimization"

Found 4 posts

April 2, 2026

Cost-Optimizing Azure OpenAI: PTUs, Batch, Caching in 2026

A concrete playbook for reducing Azure OpenAI bills in 2026. Break-even math for Provisioned Throughput Units, prompt-cache economics, the Batch API 50 percent discount, Foundry IQ for retrieval, tiered model routing, and the telemetry that keeps the wins honest.

Microsoft Ignite 2025

November 28, 2025

Microsoft Foundry: The AI Platform for the Agentic Era - Ignite 2025

From scientific research to enterprise AI transformation, discover how Microsoft Foundry unifies models from OpenAI, Anthropic, Cohere, Meta, and more into one secure platform. Learn intelligent model routing, cost optimization, and the game-changing Claude integration.

AI & Cloud Infrastructure

November 28, 2025

Fine-Tuning in Microsoft Foundry: Building Production-Ready AI Agents - Microsoft Ignite 2025

Microsoft Ignite BRK188: Fine-tuning in Microsoft Foundry transforms generic models into production-ready agents. Synthetic data generation, supervised + reinforcement fine-tuning, 40-90% cost reduction, 95%+ accuracy. Real-world results: 2M docs/day, $27M savings.

Microsoft Ignite 2025

Microsoft Foundry

Fine-Tuning

Supervised Fine-Tuning

Reinforcement Fine-Tuning

Agentic RFT

Synthetic Data Generation

AI & Cloud Infrastructure

November 28, 2025

Running Open-Source AI Models at Scale: Azure Container Apps, AKS, and On-Premise Deployments - Microsoft Ignite 2025

Microsoft Ignite BRK117: Deploy open-source AI models (Llama 3.3, Mistral) with Azure Container Apps serverless GPUs, AKS with Kaido workflows, and on-premise infrastructure. Cost reduction 60-85%, data sovereignty, and hybrid architectures with Azure Arc.

Microsoft Ignite 2025

Azure Container Apps

Azure Kubernetes Service