Category Archives: Cloud & Datacenter Management

Is Your AI Safe? Protect It in Hybrid and Multicloud Environments with Microsoft Defender for Cloud

Security in hybrid and multicloud environments is no longer a marginal topic: it’s a strategic priority. The numbers are clear: the average cost of a breach has reached $4.44 million; 86% of decision-makers believe their cybersecurity strategy isn’t keeping pace with multicloud complexity; over 40% expect a skills shortage precisely in security administration roles. In this scenario, the attack surface expands, dependencies multiply, and SecOps teams must interpret fragmented signals coming from different platforms—often with limited resources.

A shift in perspective is needed, and AI itself makes it possible: an approach that combines real-time visibility, shared context, and intelligent automation, capable of keeping up with the speed of the cloud and the evolution of threats.

This article provides an overview of the evolutions of Microsoft Defender for Cloud and how the solution helps strengthen AI security in hybrid and multicloud environments.

How AI Enables a Paradigm Shift

AI is not simply a new tool: even in security, if adopted judiciously, it becomes an operational amplifier capable of transforming posture assessment, incident analysis, and collaboration across teams. In particular, it enables you to:

Continuously assess and improve security posture, with real-time visibility and context at “hyper-cloud” scale, thanks to automatic correlations between assets, identities, configurations, and risks.
Investigate and respond to threats with unprecedented speed and expertise, with AI-driven detections and strategies, risk-based prioritization, automated playbooks, and operational guidance.
Increase productivity and collaboration through natural-language workflows, using, for example, Copilot for triage, research, queries, runbooks, and reporting.

AI Attack Surface: Where Risks Lurk

Before implementing any controls, it’s essential to map the most exposed areas across the entire lifecycle of AI solutions—identities, network, data, models, supply chain, and operations—because that’s where risks accumulate and often go unnoticed.

Identity & access. Threats arise from unprotected keys, excessive privileges that pile up over time, and the absence of JIT/PIM mechanisms to limit access and permission duration.
Network. AI endpoints exposed to the internet, uncontrolled egress, and the lack of Private Endpoints open avenues an attacker can probe.
Data. In RAG architectures with unclassified sources, risk increases: loss of ACLs during indexing and leakage in prompts or logs can expose sensitive information.
Models. The use of unapproved families/versions, absence of content safety, and lack of anti-abuse testing expose you to harmful responses, jailbreaking, and non-compliant outputs.
ML supply chain. Dataset poisoning, unverified dependencies, and unsigned container images compromise upstream integrity, contaminating the entire training and release process.
Cost masking. Anomalous token/RPM usage, key scraping, and abuse by bots/scripts generate unexpected expenses and can mask fraudulent activity.
Operations. The lack of SLOs, absence of effective rollbacks, and weak BC/DR strategies make service continuity fragile and extend recovery times.

Mapping these weaknesses is not a theoretical exercise: it’s the prerequisite for designing targeted, measurable, and sustainable controls over time. It’s also about balancing costs and the level of security you aim to achieve.

How Microsoft Defender for Cloud Intervenes

To reduce risk and gain visibility in hybrid and multicloud environments, Defender for Cloud acts on multiple levels:

CSPM (Cloud Security Posture Management). It starts with posture: evaluates configurations, maps assets and dependencies, highlights deviations, and proposes concrete remediations. All with a unified multicloud view to compare criteria and priorities across different providers.
Workload protection (CWPP). Extends coverage to workloads—VMs, containers/Kubernetes, and PaaS services (databases, storage, app services)—combining hardening recommendations and detections on runtime and configurations.
AI detections and recommendations. Makes AI workloads visible and flags risks across configurations, identities, network, and logging, aligning with emerging best practices for AI security and governance.
SecOps integration. Closes the loop with operations: forwards events and alerts to Microsoft Sentinel and Defender XDR, enables automated playbooks, and supports guided investigations to reduce MTTD/MTTR.

The result is coordinated defense: from prevention to detection to response, with ready-to-use insights that speak the same language across all clouds.

AI Security Posture Management (CSPM): “Code-to-Cloud” Visibility for Generative AI

With the Defender Cloud Security Posture Management (CSPM) plan in Microsoft Defender for Cloud, security spans enterprise on-premises environments and hybrid/multicloud scenarios (Azure, AWS, Google Cloud), covering the entire lifecycle of generative AI applications: from code, to pipelines, to production runtime.

AI Bill of Materials (AI BOM)

Defender for Cloud discovers AI workloads and reconstructs the AI BOM: application components, data, and AI artifacts, from code to cloud. This end-to-end visibility makes it possible to identify vulnerabilities, prioritize risks, and protect generative applications with targeted interventions.

Continuous discovery of AI workloads is available for major services:

Azure OpenAI Service
Azure AI Foundry
Azure Machine Learning
Amazon Bedrock
Google Vertex AI (Preview)

In addition, Defender for Cloud detects vulnerabilities in dependencies of generative AI libraries (e.g., TensorFlow, PyTorch, LangChain) by analyzing source code (IaC misconfigurations) and container images (vulnerabilities).

Contextual Insights and Recommendations

Defender CSPM provides recommendations on identities, data security, and internet exposure, helping identify and prioritize critical issues.

DevOps security & IaC misconfigurations intercept misconfigurations that expose generative apps (excessive permissions, unintentionally published services), reducing breaches, unauthorized access, and compliance problems.

Examples of IaC controls for AI

Use of Private Endpoints for Azure AI Service.
Restricting Azure AI Service Endpoints.
Managed Identity for Azure AI service accounts.
Identity-based authentication for Azure AI service accounts.

In addition, the attack path analysis feature detects and helps mitigate risks to AI workloads, even when data and compute are distributed across Azure, AWS, and GCP.

What’s New: Defender for AI Services (Runtime Protection for Azure AI Services)

Defender for AI Services introduces runtime protection for Azure AI services (formerly threat protection for AI workloads). It is designed for risks specific to generative AI and combines Microsoft Threat Intelligence and Azure AI Content Safety (Prompt Shields) with real-time analytics to detect data leakage, data poisoning, jailbreaks, credential theft, wallet abuse, suspicious access patterns, and other malicious behaviors.

Overview — Protection Against AI Threats

The solution makes it possible to identify threats to generative AI applications in real time and assists in response with context-rich alerts and recommendations. It provides coverage for endpoints and AI resources present in subscriptions, highlighting risks that can impact applications.

Integration with Defender XDR

Protection for AI services integrates with Defender XDR, allowing you to centralize alerts related to AI workloads in the XDR portal and correlate alerts and incidents with identities, endpoints, network, and applications along the entire kill chain.

Evidence from User Prompts

With the protection plan active, it is optionally possible to include in alerts suspicious segments of user prompts and/or model responses originating from apps or AI resources. This evidence is customer data and helps with triage, classification, and intent analysis. It is available in the Azure portal, Defender portal, and via specific integrations.

Application and User Context in Alerts

To maximize actionability, the solution propagates to API calls to Azure AI the context of the user and application (e.g., userId, userIp, sessionId, appId, environment, requestId). This makes it possible to block users, correlate incidents, prioritize, and distinguish suspicious activity from expected behavior for a specific app.

Data and AI Security Dashboard: Unified View, Faster Decisions

The Data and AI Security Dashboard in Microsoft Defender for Cloud offers a centralized platform to monitor and manage data and AI resources, associated risks, and protection status. It highlights critical issues, resources requiring attention, and internet-exposed assets, enabling proactive mitigation. It also provides insights on sensitive data within data services and AI workloads.

Key Benefits

Unified view of all data and AI resources in a single interface.
Insights into data location and the types of resources that host it.
Assessment of protection coverage for data and AI resources.
Attack paths, recommendations, and data threat analysis in one place.
Mitigation of critical risks and continuous posture improvement.
Security explorer highlighting useful queries to uncover insights.
Identification and synthesis of sensitive data in cloud resources and AI assets.

Data Security with Microsoft Purview

To rigorously manage data used in AI applications, you can enable integration with Microsoft Purview. This feature requires a Microsoft Purview license and is not included in the Microsoft Defender for Cloud plan for AI services.

By enabling Purview, you allow the platform to access, process, and store request and response data—including associated metadata—originating from Azure AI services. In this way, you enable key data security and compliance scenarios, such as:

Sensitive Information Type (SIT) classification.
Analysis and reporting with Microsoft Purview DSPM for AI.
Insider risk management.
Communications compliance.
Microsoft Purview auditing.
Data lifecycle management.
Electronic discovery (eDiscovery).

In practice, this integration makes it possible to govern and monitor AI-generated data in alignment with corporate policies and regulatory requirements, fostering responsible, traceable, and compliant use of AI throughout the entire information lifecycle.

Conclusions

AI security in hybrid and multicloud environments requires a continuous, measurable, risk-oriented posture. Microsoft Defender for Cloud provides the tools to move from visibility to operational protection: discovery of workloads and AI BOM, contextual recommendations and attack path analysis, through to runtime protection with Defender for AI Services and incident correlation in Defender XDR and Microsoft Sentinel. Integration with Microsoft Purview makes it possible to govern the data that fuel models, ensuring traceability and compliance throughout the entire lifecycle.

The recommended path is clear: map the AI attack surface; enable CSPM and essential IaC controls; extend coverage to key workloads (VMs, containers, PaaS); activate runtime protection for Azure AI services; and centralize detection and response. Only then does AI become a multiplier of resilience rather than a new vector of risk. Finally, remember that absolute security in IT does not exist (except for systems that are powered off and completely isolated): it is therefore essential to balance costs, operational impact, and the desired level of protection, based on the value of assets and acceptable risk.

The 7 Pillars of AI Governance on Azure PaaS — A Practical Guide

AI is no longer theory; it’s everyday practice: pilot projects, enterprise chatbots, and new customer-facing features. Adoption is accelerating—often faster than an organization’s ability to govern it. In the midst of this race, Azure’s AI PaaS offerings provide a fast track to experiment and move services into production. But speed without guardrails comes at a cost: data exposure, unpredictable spend, opaque decision-making, and compliance risks that can slow innovation precisely when it should be accelerating.

Governance isn’t a brake on creativity—it’s the structure that lets AI become repeatable, safe, and measurable value. It means aligning investments with business goals, clarifying accountability, and defining controls, observability, and lifecycles; it means knowing where models live, who uses them, with what data, and at what cost. In Azure, where many capabilities are just “an API call away,” the line between a brilliant idea and an operational incident often comes down to the quality of your governance choices.

This article turns the Cloud Adoption Framework guidance into practical recommendations for governing Azure’s AI PaaS services. The journey is organized into seven complementary domains that together build a responsible AI posture: governing platforms, models, costs, Security, operations, regulatory compliance, and data.

In the chapters that follow, we’ll dive into each domain with an operational focus. The goal is simple: to lay the foundation for a governance framework that unlocks innovation, reduces risk, and keeps AI aligned with the business—today and as it evolves.

Governing AI Platforms

If the foundation isn’t consistent, every team ends up “doing its own thing.” Platform governance exists precisely to prevent that: to apply uniform policies and controls to Azure AI services so security, compliance, and operations stay aligned as architectures evolve.

Put this into practice:

Leverage built-in policies. With Azure Policy you’re not starting from scratch: there are ready-made definitions covering common needs—security setup, spending limits, compliance requirements—without custom development. Assign these policies to Azure AI Foundry, Azure AI Services, and Azure AI Search to standardize identity, Networking, logging, and required baseline configurations.
Enable Azure Landing Zone policy sets. Landing zones include curated, tested initiatives for AI workloads, already aligned with Microsoft recommendations. During deployment, select the Workload Specific Compliance category and apply the dedicated initiatives (e.g., Azure Openai, Azure Machine Learning, Azure AI Search, Azure Bot Service) to achieve broad, consistent coverage across environments.

Governing AI Models

A powerful but ungoverned model produces unpredictable results. Model governance ensures safe, reliable, and ethical outputs by setting clear rules for model inputs, outputs, and usage. Here’s what to implement:

Inventory agents and models.
Use Microsoft Entra Agent ID to maintain a centralized view of AI agents created with Azure AI Foundry and Copilot Studio. A complete inventory enables access enforcement and compliance monitoring.
Restrict approved models.
With Azure Policy, limit which model families/versions can be used in Azure AI Foundry. Apply model-specific policies to meet your organization’s standards and requirements.
Establish continuous risk detection. Before release and on a recurring basis:
- Enable AI workload discovery in Defender for Cloud to identify workloads and assess risks pre-deployment.
- Schedule regular red-team exercises on generative models to uncover weaknesses.
- Document and track identified risks to ensure accountability and continuous improvement.
- Update policies based on findings so controls stay effective and aligned with current risks.
Apply content-safety controls everywhere.
Configure Azure AI Content Safety to filter harmful content on both inputs and outputs. Consistent application reduces legal exposure and maintains uniform standards.
Ground your models.
Steer outputs with system messages and RAG (retrieval-augmented generation); validate effectiveness with tools like PyRIT, including regression tests for consistency, safety, and answer relevance.

Governing AI Costs

AI can burn through budget quickly if you don’t govern consumption, capacity, and usage patterns. The goal is predictable performance, controlled spend, and alignment with business objectives. Here’s what to put into practice:

Choose the right billing model for the workload.
For steady workloads, use commitment tiers / provisioned throughput. With Azure OpenAI, Provisioned Throughput Units (Ptus) offer more predictable costs than pay-as-you-go when usage is consistent. Combine PTU endpoints as primaries with consumption-based endpoints for spikes, ideally behind a gateway that routes traffic intelligently.
Select appropriately sized models—avoid overkill.
Model choice directly impacts cost; less expensive models are often sufficient. In Azure AI Foundry, review pricing and billing mechanics, and use Azure Policy to allow only models that meet your cost and capacity targets.
Set quotas and limits to prevent overruns.
Define per-model/per-environment quotas based on expected load and monitor dynamic quotas. Apply API limits (max tokens, max completions, concurrency) to avoid anomalous consumption.
Pick deployment options that are cost-effective and compliant.
Models in Azure AI Foundry support different deployment modes; prefer those that optimize both cost and regulatory requirements for your use case.
Govern client-side usage patterns.
Uncontrolled access makes spend explode: enforce network controls, keys, and RBAC; impose API limits; use batching where possible; and keep prompts lean (only the necessary context) to reduce tokens.
Auto-shut down non-production resources.
Enable auto-shutdown for VMs and compute in Azure AI Foundry and Azure Machine Learning for dev/test (and in production when feasible) to avoid costs during idle periods.
Introduce a generative gateway for centralized control.
A generative AI gateway enforces limits and circuit breakers, tracks token usage, throttles, and load-balances across endpoints (PTU/consumption) to optimize costs.
Apply cost best practices for each service.
Every Azure AI service has its own levers and pricing. Follow the service-specific guidance (e.g., for Azure AI Foundry) to choose the most efficient option for each workload.
Monitor consumption patterns and billing breakpoints.
Keep an eye on TPM (tokens per minute) and RPM (requests per minute) to tune models and architecture. Use fixed-price thresholds (e.g., image generation, hourly fine-tuning) and consider commitment plans when usage is steady.
Automate budgets and alerts.
In Azure Cost Management, set budgets and multi-threshold alerts to catch anomalies before they impact projects, maintaining financial control over AI initiatives.

Governing AI Security

Protecting data, models, and infrastructure requires consistent controls across identity, Networking, and runtime. The goal: reduce attack surface and preserve the reliability of your solutions. Here’s what to put into practice:

Enable end-to-end threat detection.
Turn on Microsoft Defender for Cloud on your subscriptions and enable protection for AI workloads. The service surfaces weak configurations and risks before they become vulnerabilities, with actionable recommendations.
Apply least privilege with RBAC.
Start everyone at Reader and elevate to Contributor only when truly needed. When built-in roles are too permissive, create custom roles that limit access to only the required actions.
Use managed identities for service authentication.
Avoid secrets in code or config. Assign a Managed Identity to every service that accesses model endpoints and grant only the minimum permissions required on application resources.
Enable just-in-time access for admin operations.
With Privileged Identity Management (PIM), elevation is temporary, justified, and approved—reducing privileged account exposure and improving traceability.
Isolate AI endpoint networking.
Prefer Private Endpoints and VNet integration to avoid Internet exposure. Where supported, use service endpoints or firewalls/allow-lists to permit access only from approved networks, and disable public network access on endpoints.

Governing AI Operations

Operations are what keep AI stable over time: without controls on lifecycle, continuity, and observability, even the best model stalls at the first hiccup. The objectives: reliability, clear recovery times, and steady business value.

Define model lifecycle policies.
Standardize versioning and compatibility with mandatory pre-rollout tests (functional, performance, and safety). Plan release strategies (shadow/canary/blue-green), rollback procedures, and deprecation/retirement rules valid across platforms (Azure AI Foundry, Azure Openai, Azure AI Services). Document dependencies, feature flags, and the version compatibility matrix.
Plan business continuity and disaster recovery.
Set RTO/RPO and configure baseline DR for resources exposing model endpoints: replicate across paired regions, use Infrastructure as Code (Bicep/Terraform) for rebuild, and place a gateway in front for failover and cross-instance/region routing. Where possible, enable zone redundancy; snapshot/backup configurations (prompts, safety settings, embeddings/vector stores); and run periodic tests to validate plans.
Configure monitoring and alerting for AI workloads.
Enable Azure Monitor / Log Analytics / Application Insights and set recommended alerts for Azure AI Search, Azure AI Foundry Agent Service deployments, and individual Azure AI Services. Track key SLIs (latency, 4xx/5xx error rates, timeouts, throughput, HTTP 429) and surface degradation before it impacts users. Centralize logs, Define slis, and create intervention runbooks with escalation paths and automated actions where feasible.

Governing Regulatory Compliance for AI

Regulatory compliance isn’t bureaucracy: it defines what’s acceptable, reduces legal risk, and builds trust. It requires a continuous, automated, and demonstrable process. Here’s what to put into practice:

Automate assessments and management.
Use Microsoft Purview Compliance Manager to centralize assessments and tracking, assign remediation actions, and maintain evidence. In Azure Policy, apply the Regulatory Compliance initiatives relevant to your sector to enforce controls and continuously monitor for deviations.
Build frameworks specific to your industry/country.
Rules differ by industry and geography: create targeted checklists and control mappings (privacy, Security, transparency, human oversight). Adopt standards such as ISO/IEC 23053:2022 to audit policies applied to machine learning workloads, and define a cadence for periodic reviews.
Make compliance auditable by design.
Define responsibilities (COOL), exception handling with expirations (waivers), and an evidence repository (policy assignments, change history, RBAC logs). Tie compliance KPIs to shared dashboards to demonstrate alignment and continuous improvement.

Governing AI Data

Without clear data rules, risks, costs, and inconsistent results grow. Data governance protects sensitive information and intellectual property, and underpins output quality. Here’s what to activate:

Centralized discovery and classification.
Use Microsoft Purview to scan, catalog, and classify data across the organization (data lakes, databases, storage, M365). Define consistent taxonomies/labels and leverage Purview SDKs to enforce policies directly in pipelines (e.g., block ingestion of “Confidential” data into noncompliant endpoints).
Maintain security boundaries across AI systems.
Indexing can decouple native source controls: require a security review before data flows into models, vector indexes, or prompts. Preserve and enforce ACLs/access metadata at the chunk level, limit exposure with Private Endpoints/VNet, and apply least privilege to indexing workflows. Accept only data that’s already classified and meets internal standards.
Prevent copyright violations.
Apply filters with Azure AI Content Safety — Protected Material Detection — on generative inputs and outputs. For training/fine-tuning, use only lawful sources and appropriate licenses, maintaining provenance and evidence (contracts, terms of use) for audits and disputes.
Version training and grounding (RAG) data.
Treat datasets like code: Snapshots, immutable versions, changelogs, and rollback. Align each model/endpoint version with the corresponding data version (documents, embeddings, filtering policies) to ensure consistency across environments and over time.

Conclusions

AI creates value when delivery speed is channeled within clear, measurable rules. Governance here doesn’t mean braking; it means scaling what works, knowing why it works, and proving it at every audit, incident, or business decision. The path is pragmatic: define a minimal, uniform baseline (identity, Networking, policy, logging), measure outcomes with a small set of shared indicators, automate as much as possible, and evolve controls at the same cadence as models and data. You don’t need perfection on the first try: you need short cycles, explicit accountability, and infrastructure as code to quickly replicate choices that prove effective. In this context, Azure’s PaaS platforms become reliable accelerators because they operate within predictable boundaries: rapid experimentation, yes—but with guardrails, observability, and continuity plans already built in. The result is innovation that stays aligned with the business, reduces risk and reliance on chance, and turns AI into a repeatable, sustainable enterprise asset.

RAG on Azure Local: the evolution of generative AI in hybrid environments

In the era of Artificial Intelligence, companies are required to combine computational power with distributed data management, as data is increasingly located across cloud environments, on-premises infrastructures, and edge settings. In this context, Azure Local emerges as a strategic solution, capable of extending the benefits of cloud computing directly into local data centers—where the most sensitive and critical workloads reside. After exploring this topic in the previous article, “AI from Cloud to Edge: Innovation Enabled by Azure Local and Azure Arc," this new piece focuses on a particularly significant evolution: the adoption of RAG Capabilities (Retrieval-Augmented Generation) within Azure Local environments. Thanks to Microsoft’s adaptive cloud approach, it is now possible to design, deploy, and scale AI solutions consistently and in a controlled manner, even in hybrid and multicloud scenarios. Azure Local thus becomes the enabler of a tangible transformation, bringing generative AI capabilities closer to the data, with clear benefits: reduced latency, preservation of data sovereignty, and greater accuracy and relevance of the generated results.

A Consistent AI Ecosystem from Cloud to Edge

Microsoft is building a consistent and distributed Artificial Intelligence ecosystem, designed to enable the development, deployment, and management of AI models wherever they are needed: in the cloud, on-premises environments, or at the edge.

This approach is structured into four key layers, each designed to address specific needs:

Application Development: With Azure AI Studio, developers can easily design and build intelligent agents and conversational assistants using pre-trained models and customizable modules. The development environment offers integrated tools and a modern interface, simplifying the entire AI application lifecycle.
AI Services: Azure offers a wide range of advanced AI services — including language models (based on OpenAI), machine translation, computer vision, and semantic search — which, until now, were limited to the cloud environment. With the introduction of RAG in Azure Local, these capabilities can now also be executed directly in local environments.
Machine Learning and MLOps: Azure Machine Learning Studio allows for efficient creation, training, optimization, and management of ML models. Thanks to the AML Arc Extension, all these features are now also available on local and edge infrastructures.
AI Infrastructure: Supporting all these layers is a solid and scalable technology foundation. Azure Local, together with Azure’s global infrastructure, provides the ideal environment for running AI workloads through containers and optimized virtual machines, ensuring high performance, Security, and compliance.

Microsoft’s goal is clear: to eliminate the boundary between the cloud and the edge, enabling organizations to harness the power of AI where the data actually resides.

What is Retrieval-Augmented Generation (RAG)

Within the unified AI ecosystem Microsoft is building, one of the most impactful innovations is Retrieval-Augmented Generation (RAG) — an advanced technique poised to revolutionize the approach to generative AI in the enterprise space. Unlike traditional models that rely solely on knowledge learned during training, RAG enriches model responses by dynamically retrieving up-to-date and relevant content from external sources such as documents, databases, or vector indexes.

RAG operates in two distinct but synergistic phases:

Retrieve: The system searches and selects the most relevant information from external sources, often built using enterprise data.
Generate: The retrieved content is used to generate more accurate responses, consistent with the context and aligned with domain-specific knowledge.

This architecture helps reduce hallucinations, increase response accuracy, and work with updated and specific data without retraining the model, thereby ensuring greater flexibility and reliability.

RAG on Azure Local: Generative AI Serving On-Premises Data

With the introduction of RAG Capabilities in Azure Local environments, organizations can now bring the power of generative AI directly to their data—wherever it resides: in the cloud, on-premises, or across multicloud infrastructures—without needing to move or duplicate it. This approach roots artificial intelligence in enterprise data and enables the native integration of advanced capabilities into local operational workflows.

The solution is available as a native Azure Arc extension for Kubernetes, providing a complete infrastructure for data ingestion, vector index creation, and querying based on language models. Everything is managed through a local portal, which offers essential tools for prompt engineering, monitoring, and response evaluation.

The experience is designed in a No-Code/Low-Code fashion, with an intuitive interface that allows even non-specialized teams to develop, deploy, and manage RAG applications.

Key Benefits

Data Privacy and Compliance: Sensitive data remains within corporate and jurisdictional boundaries, allowing the model to operate securely and in compliance with regulations.
Reduced Latency: Local data processing enables fast responses, which are crucial in real-time scenarios.
Bandwidth Efficiency: No massive data transfers to the cloud, resulting in optimized network usage.
Scalability and Flexibility: Thanks to Azure Arc, Kubernetes clusters can be deployed, monitored, and managed on local or edge infrastructures with the same operational experience as the cloud.
Seamless Integration with Existing Environments: RAG capabilities can be directly connected to document repositories, databases, or internal applications, enabling scenarios such as enterprise chatbots, intelligent search engines, or vertical digital assistants—natively and without invasive infrastructure changes.

This capability represents a fundamental element in Microsoft’s strategy: to make Azure the most open, extensible, and distributed AI platform, capable of enabling innovation wherever data resides and transforming it into a true strategic asset for the digital growth of organizations.

Advanced RAG Capabilities on Azure Local

The RAG capabilities available in Azure Local environments go beyond simply bringing generative AI closer to enterprise data—they represent a comprehensive set of advanced tools designed to deliver high performance, maximum flexibility, and full control, even in the most demanding scenarios. Thanks to continuous evolution, the platform is equipped to support complex and dynamic use cases, while keeping quality, Security, and responsibility at the forefront.

Here are the main advanced features available:

Hybrid Search and Lazy Graph RAG (coming soon): The combination of hybrid search with the upcoming support for Lazy Graph RAG enables the creation of efficient, fast, and low-cost indexes, providing accurate and contextual responses regardless of the nature or complexity of the query.
Performance Evaluation: Native evaluation pipelines allow structured testing and measurement of RAG system effectiveness. Multiple experimentation paths are supported—helpful for comparing different approaches in parallel, optimizing prompts, and improving response quality over time.
Multimodality: The platform natively supports text, images, documents, and—soon—videos. By leveraging the best parsers for each format, RAG on Azure Local can process unstructured data located on NFS shares, offering a unified and in-depth view across various content types.
Multilingual Support: Over 100 languages are supported during both ingestion and model interactions, making the solution ideal for organizations with a global presence or diverse language requirements.
Always-Up-to-Date Language Models: Each update of the Azure Arc extension provides automatic access to the latest models, ensuring optimal performance, enhanced security, and alignment with the latest advancements in generative AI.
Responsible and Compliant AI by Design: The platform includes built-in capabilities for managing security, regulatory compliance, and AI ethics. Generated content is monitored and filtered, helping organizations comply with internal policies and external regulations—without placing additional burden on developers.

Key Use Cases of RAG on Azure Local

The integration of RAG into Azure Local environments delivers tangible benefits across several sectors:

Financial Services: in the financial sector, RAG can analyze sensitive data that must remain on-premises due to regulatory constraints. It can automate compliance checks on documents and transactions, provide personalized customer support based on financial data, and create targeted business proposals by analyzing individual profiles and preferences.
Manufacturing: for manufacturing companies, RAG is a valuable ally for enhancing operational efficiency. It can offer real-time assistance in problem resolution through analysis of local production data, help identify process inefficiencies, and support predictive maintenance by anticipating failures through historical data analysis.
Public Sector: public administrations can leverage RAG to gain insights from the confidential data they manage. It’s useful for summarizing large volumes of information to support quick and informed decision-making, creating training materials from existing documentation, and enhancing public safety through predictive analysis of potential threats based on local data.
Healthcare: in the healthcare sector, RAG enables secure handling of clinical data, delivering value across multiple areas. It can support the development of personalized treatment plans based on patient data, facilitate medical research through clinical information analysis, and optimize hospital operations by analyzing patient flow and resource usage.
Retail: in the retail sector, RAG can enhance customer experiences and streamline business operations. It is effective for creating personalized marketing campaigns based on purchasing habits, optimizing inventory management through sales data analysis, and gaining deeper insights into customer behavior to refine product and service offerings.

Conclusion

The integration of RAG capabilities within Azure Local environments marks a significant milestone in the maturity of distributed Artificial Intelligence solutions. With an open, extensible, and cloud-connected architectural approach, Microsoft enables organizations to leverage the benefits of generative AI consistently—even in hybrid and on-premises scenarios. RAG capabilities, in particular, allow advanced language models to connect with the contextual knowledge stored in enterprise systems—without compromising governance, Security, or performance. This evolution makes it possible to create intelligent, secure, and customized applications across any operational context, accelerating the time-to-value of AI across multiple industries. Azure Local with RAG represents a strategic opportunity for businesses that want to govern Artificial Intelligence where data is born, lives, and generates value.

SQL Server Licensing: How Azure Arc Can Change the Rules

In my previous article, I explored how Azure Arc enables organizations to harness the power of the Azure cloud in managing SQL Servers, regardless of where the databases reside: on-premises, at the edge, or in other cloud environments. This extension of the Azure platform allows for centralized governance, enhanced security, and advanced features without requiring a full migration to the cloud.

But once this new management approach is enabled, what services are available, and how is licensing handled? What models are available, and how do they differ from traditional SQL Server licensing?

In this article, we’ll answer these questions by delving into the SQL Server licensing model enabled by Azure Arc and comparing the different approaches to help organizations choose the solution that best fits their needs.

Features Included at No Additional Cost

Azure Arc for SQL Server provides many features at no extra charge, depending on the type of license held. If the organization already has a SQL Server license with Software Assurance (L+that) or opts for the PAYG (Pay-As-You-Go) model, it can access advanced tools for free, such as:

Best practices assessment
Automated patching
Automated local backups
Point-in-time restore
TDE encryption via Azure Key Vault

For customers with a License-only (L-only) model, even without SA, key governance features are still included—such as resource inventory, failover cluster management, and support for Always-On Availability Groups.

These capabilities allow for a cloud-like management experience, even while keeping databases on local infrastructure.

Figures 1 – SQL Server enabled by Azure Arc pricing model

Value-Added Advanced Services

Naturally, Azure Arc also enables the extension of feature sets through optional paid services, which can be activated selectively based on need:

Microsoft Defender for SQL Server, for advanced protection
Log Analytics and Azure Monitor, for deep monitoring
Azure Policy, for configuration and compliance management
Purview, for data governance
Cluster-aware patching and long-term backups to Azure or Amazon S3, for resilient and modern operations

This modularity allows organizations to scale their management capabilities based on actual needs while maintaining control over costs.

A New Perspective on Licensing Management

Traditionally, SQL Server licensing has been based mainly on Enterprise Agreements and Software Assurance contracts, binding companies to three-year purchases and requiring accurate forecasting of future usage. However, this approach doesn’t align well with modern IT environments, which are marked by workload fluctuations, hybrid adoption, and the need for more dynamic cost optimization.

Limitations of Traditional Licensing

In the face of this new flexibility, it’s worth highlighting the shortcomings of the traditional model. In addition to rigid contracts and lack of flexibility for workloads, organizations often face:

Difficulty tracking actual usage
Risk of under- or over-provisioning
Unexpected and costly true-ups
Complexities in managing across multiple teams and locations

In hybrid and distributed scenarios, these limitations can slow down processes and increase costs.

This is exactly where Azure Arc comes in—not only to extend management functionalities but also to introduce new licensing models that overcome past limitations.

The PAYG Model: Licensing That Fits

To meet these needs, Azure Arc offers a Pay-As-You-Go (PAYG) model for SQL Server, allowing organizations to pay strictly for what they use—hourly or monthly.

The benefits are significant:

No upfront costs: Ideal for temporary environments, testing, or seasonal workloads.
Adaptability: Licensing follows actual usage, reducing waste.
Targeted billing: Costs can be broken down by project, department, or individual server.
Visibility and control: The Azure portal enables continuous monitoring, compliance checks, and role-based access.
Cost-saving opportunities: PAYG licenses can be included in MACC agreements and treated as OpEx, making spending more predictable.

Conclusion

The true value of Azure Arc for SQL Server lies not only in its technical capabilities but in the innovative operating model it enables: greater visibility, centralized control, process automation, and cost optimization.

Whether it’s environments under strict regulatory requirements, intermittent workloads, or gradual modernization journeys, Azure Arc offers a flexible licensing approach that aligns perfectly with real business needs.

Azure Arc truly revolutionizes SQL Server license management, moving beyond a traditional, often rigid and complex model, to embrace a dynamic, transparent model that is natively integrated with Azure cloud tools.

This evolution allows organizations to respond more agilely to the challenges of an increasingly distributed IT landscape, making the most of existing infrastructure and accelerating digital transformation.

On-Premises GPUs for AI and Virtual Desktop: How Azure Local is Changing the Game

In recent years, the adoption of technologies based on Artificial Intelligence, machine learning, and desktop virtualization has accelerated dramatically. However, behind the innovation visible to end users lies a fundamental requirement: high-performance IT infrastructures capable of efficiently handling complex workloads. While the cloud often appears to be the go-to solution, it is not always the only — or the most suitable — option for every need.

In certain scenarios, organizations are increasingly demanding local solutions that deliver high performance, low latency, and strict data control. This need is often driven by concerns related to security, regulatory compliance, and specific architectural constraints that require workloads to run directly in on-premises environments.

This is where Azure Local comes into play — a Microsoft offering that redefines the concept of hybrid infrastructure. Combining the power of Azure cloud with the flexibility of local deployment, Azure Local enables full utilization of GPUs directly within your datacenter, empowering AI and Virtual Desktop Infrastructure (VDI) scenarios with high performance and full operational control.

Why Use On-Premises GPUs?

Let’s start with a key point: enterprise-grade GPUs — those built for datacenters and heavy workloads—have become essential for handling complex tasks such as:

Training and inference of AI models
Real-time video and visual processing
Virtualization of graphic-intensive desktops and compute-heavy environments

But what happens when these workloads need to run in environments where:

Internet connectivity is absent, unstable, or unsuitable for continuous traffic
The data being processed is too sensitive or regulated to be moved to the public cloud
Applications require immediate response times without delays from round trips to Azure

In these cases, having on-premises GPUs fully integrated into your local infrastructure is not just a strategic choice — it’s often a necessity. This is where Azure Local steps in, enabling organizations to harness the power of enterprise GPUs right in their datacenter, with the simplicity and scalability of the Azure experience.

What is Azure Local?

Azure Local is essentially the extension of Microsoft’s cloud services directly into your own datacenter. It delivers a selection of Azure services, the same APIs, and the same management model—but with the option to run everything locally, wherever it’s needed: on-premises or at the edge.

With Azure Local, you can deploy applications, virtual desktops, and AI models within your own infrastructure while retaining complete data control — benefiting from the flexibility, scalability, and operational consistency of the cloud. No need to move sensitive data. No compromise on the Azure experience. Just the resources you need, right where you need them.

Azure Local + GPUs: A Powerful Combination

One of Azure Local’s most compelling features is its native GPU support, allowing you to tackle AI workloads and VDI environments with high performance and operational efficiency. You can choose between two usage modes:

DDA – Discrete Device Assignment: The GPU is exclusively assigned to a single virtual machine. This is the most powerful mode, ideal for scenarios requiring maximum compute power, such as AI model training, deep learning, or advanced rendering.
GPU-P – GPU Partitioning: In this mode, the GPU is divided into multiple virtual partitions, each assignable to a VM. Perfect for maximizing efficiency and supporting multi-user environments like VDI.

Both modes are fully compatible with NVIDIA drivers and support major compute and graphics libraries, including CUDA, OpenGL, and DirectX.

Which GPUs Are Supported?

Supported models currently include:

NVIDIA A2 and A16 – Supported in both DDA and GPU-P modes
Nvidia A10, A40, L4, L40, L40S – Supported in GPU-P mode

All are centrally manageable through Azure Arc, ensuring full control and visibility — even in the most distributed environments.

What About Virtual Desktops?

This is where things get even more interesting.

With Azure Virtual Desktop on Azure Local, you can deliver modern, high-performance, and secure desktop experiences directly within your on-premises environments. This means bringing the benefits of cloud-native VDI to where it truly matters, with session hosts physically close to end users.

The result? A significantly improved user experience, thanks to:

Ultra-low latency, ideal for on-site users or limited connectivity environments
Optimized performance for graphics applications and compute-intensive workloads
Data that remains on-premises, ensuring security and compliance
Full compatibility with Windows 11 and 10 in multi-session mode
Native integration with traditional Active Directory and Microsoft Entra ID

All orchestrated via the Azure portal, with the same provisioning, monitoring, and management tools—simplifying administration and ensuring operational consistency between cloud and datacenter.

AI Where It Truly Matters

When it comes to Artificial Intelligence, Azure Local is a game changer. You can now train, deploy, and manage AI models directly on-premises or at the edge, without relying on the cloud for every step of the process.

How? With two key technologies:

Edge RAG (Retrieval-Augmented Generation): Enhances generative models by integrating your local data—without ever moving it out of your environment. An ideal solution for highly secure and confidential use cases such as healthcare, government, or regulated industries.
Azure Machine Learning with Azure Arc: A unified platform for managing the entire lifecycle of AI models — from training to deployment — whether in the cloud or on-premises, using the same tools, APIs, and capabilities.

The result? In hybrid, secure, scalable, and fully localized AI ecosystem, designed to bring intelligence right where it’s needed: close to your data, your users, and your business-critical processes.

But Is It Complicated to Set Up?

Absolutely not. One of Microsoft’s main goals has been to simplify the configuration and management experience of Azure Local — even in GPU scenarios.

To get started, you simply need:

GPU-compatible physical hosts (over 100 validated models available)
VMs configured according to recommended technical requirements for DDA or GPU-P
Connection to Azure Arc for centralized, consistent, and secure management

Once the environment is up and running, you can operate just as you would in the Azure cloud — but with your own data, your own network, and full infrastructure control. No added complexity—just greater operational flexibility.

Conclusion

In a constantly evolving tech landscape, where performance, Security, and compliance demands are increasingly strict, Azure Local stands out as a true game changer. The ability to bring enterprise GPUs directly into local datacenters—while preserving the experience, scalability, and consistency of Azure cloud — empowers organizations to effectively tackle AI, VDI, and high-performance workloads.

Whether it’s about achieving ultra-low latency, protecting sensitive data, or operating in limited-connectivity environments, Azure Local offers a modern and tangible solution, with a flexible, manageable, and most importantly, accessible hybrid approach.

This is not simply about “bringing the cloud on-premises.” It’s about redefining how IT infrastructure supports core business processes, enabling advanced scenarios without compromise in control, performance, or security.

Ultimately, Azure Local is an excellent choice for those looking to bring innovation exactly where it’s needed most: close to the data, the users, and the everyday operational needs.

Bringing the Power of the Cloud to SQL Server Anywhere with Azure Arc

In today’s IT landscape, characterized by increasingly complex and distributed infrastructures, Azure Arc represents a strategic solution to address the challenges of multi-cloud and hybrid management. Azure Arc is an extension of the Microsoft Azure platform that allows managing, governing, and securing distributed resources across on-premises, multi-cloud, and edge environments, ensuring a unified experience. With Azure Arc, hybrid infrastructures can be managed as native Azure resources, simplifying governance and automation. Furthermore, Azure Arc-enabled services allow Azure workloads to be deployed and executed outside the public cloud while maintaining centralized control. This article explores how Azure Arc for SQL Server enables organizations to modernize the governance and management of their databases without necessarily migrating to the cloud.

Azure Arc for SQL Server

Azure Arc for SQL Server extends Azure capabilities to SQL databases residing on-premises, in other clouds, or edge environments. This solution allows managing, securing, and governing SQL Server just like a native Azure database without requiring data migration to the cloud. Thanks to this technology, businesses can ensure high performance and security while adapting to their specific needs. Additionally, it enables organizations to adopt a continuous modernization strategy, leveraging advanced cloud tools to enhance SQL Server database lifecycle management.

Figure 1 – Extending Azure Services to SQL Server

Specifically, Azure Arc connects SQL servers to the Azure control plane, enabling:

Monitoring and managing SQL instances from a single interface, regardless of their location.
Applying governance policies uniformly across all databases, ensuring compliance and security.
Integrating advanced features such as Defender for SQL Server and Azure Policy to enhance data protection.
Automating updates and patching, reducing security vulnerability risks.

Enabling SQL Server with Azure Arc

The following procedure connects SQL Server instances to the Azure control plane:

1. Script Generation and Execution on the Server

A configuration script must be executed on the virtual or physical machine hosting SQL Server.
This script installs the necessary components to integrate the server with Azure Arc.

2. Creating Local Services

The Arc agent is installed, allowing the discovery of SQL Server instances and integration with Azure services.
The Azure extension for SQL Server is enabled to provide advanced management features.

3. Resource Registration in Azure

The server is registered as an Arc-enabled server, while each SQL instance is registered as an Arc-enabled SQL Server.
This integration allows SQL Server to be managed as a native Azure resource while keeping the database on local or multi-cloud infrastructures.

4. Enabling the Azure Monitoring Agent

The monitoring agent collects event logs, performance metrics, and resource usage data.
These data can be analyzed in Azure Monitor for performance tracking and security.

5. Onboarding Scalability

The onboarding process can be automated at scale using tools like Group Policy or System Center Configuration.
This allows managing a large number of SQL servers with a single centralized configuration.

Figure 2 – SQL Server Architecture Enabled by Azure Arc

Key Benefits

Azure Arc for SQL Server is ideal for organizations looking to keep their databases on-premises or in hybrid environments while leveraging Azure’s power and advanced management tools without fully migrating to the cloud. The key benefits include:

Simplified Management – A single Azure interface enables centralized governance of distributed databases.
Enhanced Security – Advanced governance, encryption, and access management features protect data uniformly.
High Reliability – Support for High Availability and Disaster Recovery (HA/DR) ensures operational continuity.
Seamless Integration with Azure – Direct connection with tools like Defender for SQL, Azure Monitor, and Purview.
Reduced Downtime – Advanced automation minimizes the risk of downtime and operational disruptions.
Improved Compliance Management – Enhanced ability to meet corporate and industry regulations.

Management Capabilities

The following image compares management capabilities across three SQL Server environments:

Traditional SQL Server (on customer infrastructure or third-party cloud)
Azure Arc-enabled SQL Server
SQL Server on an Azure Virtual Machine

Figure 3 – Management Capabilities Comparison

Key Differences and Benefits of Azure Arc for SQL Server

Traditional SQL Server lacks most advanced management and security features, such as automatic patching, backup, monitoring, and compliance. Azure Arc introduces numerous advanced capabilities for SQL Server on-premises or in other clouds, including:

Inventory Management
Automated Patching and Backup
Monitoring and Security with Defender for SQL Server
TDE Encryption with Azure Key Vault
Point-in-time Restore
HA/DR Management and Cluster-aware Updates

SQL Server on an Azure VM offers a high level of functionality, including integration with Azure Policy, long-term backup, and Purview Premium—some of which are planned as future features for Arc.

Conclusion

Azure Arc for SQL Server represents a strategic solution for organizations seeking to modernize their database management without necessarily migrating to the cloud. By integrating with the Azure control plane, this technology provides a unified and centralized view of all SQL Server instances, regardless of their location, simplifying governance, security, and compliance operations.

AI from Cloud to Edge: Innovation Powered by Azure Local and Azure Arc

In the era of Artificial Intelligence, which is significantly transforming business models, the adoption of local and distributed infrastructures is crucial for managing specific and mission-critical workloads. In this context, Azure Local emerges as an innovative solution capable of bridging the gap between cloud and edge computing, delivering applications, data, and AI services exactly where they are needed. This article will explore real-world scenarios where Azure Local, combined with Azure Arc, enables real-time data processing “at the source” and the deployment of advanced AI solutions. We will also delve into the new Azure AI services designed for Azure Local, focusing on maximizing the potential of on-premises data.

Real-World Scenarios of Local and Distributed Infrastructure with Azure Local

In the following sections, we will examine concrete examples that demonstrate how Azure Local, in synergy with Azure Arc, effectively addresses the needs of distributed infrastructure, ensuring low latency, security, and operational continuity across various business and industrial contexts.

Figure 1 – Real-World Scenarios for Local and Distributed Infrastructure with Azure Local

Local AI Inferencing

In many situations, analyzing data in real-time directly at the edge (e.g., within a retail store or an industrial facility) provides significant advantages in terms of latency and reduced bandwidth usage. Azure Local enables on-site data processing, eliminating the need to transfer all data to the cloud before performing critical analyses. Here are some examples:

Retail Loss Prevention: With AI integrated locally, suspicious behaviors and potential thefts can be identified in real-time, allowing retailers to act immediately and reduce losses.
Smart Self-Checkout: Video surveillance and visual analysis facilitate automatic item recognition, improving customer experience and reducing wait times.
Pipeline Monitoring: In sectors like oil & gas, real-time video monitoring of infrastructure helps detect anomalies and leaks, reducing environmental risks and ensuring timely interventions.

Operational Continuity in Mission-Critical Environments

The ability to ensure business continuity during network or power outages is a crucial aspect. With Azure Local, robust systems can be implemented to preserve operations even when cloud connectivity is limited or unavailable. Examples include:

Factory and Warehouse Operations: Production and inventory management cannot stop; having a local solution ensures that production lines and management systems continue functioning despite network disruptions.
Stadiums and Event Venues: Services like security, ticketing, and lighting must remain operational to safeguard both spectator experience and safety.
Transport Hubs: Constant operation of ticketing systems, scheduling, and communications is essential for passenger flow and safety in large transit hubs.

Control Systems and Near Real-Time Processing

Some industrial, financial, and healthcare environments demand extremely low response times to avoid errors, ensure safety, or maximize performance. Azure Local, combined with Azure Arc, can meet these latency requirements:

Manufacturing Execution Systems (MES): Continuous synchronization and monitoring of production machinery optimize processes and minimize downtime.
Industrial Quality Assurance (QA): Immediate quality checks and verifications identify defects before they reach the final stage of production, increasing compliance and reducing waste.
Financial Infrastructures: Low-latency transaction processing and rapid risk assessment are critical for market competitiveness and stability.

Regulatory Compliance and DDIL Connectivity (Disconnected, Degraded, Intermittent, Limited)

For many organizations (governmental, military, or those operating critical infrastructures), data protection and secure management, even in the absence of reliable connectivity, are top priorities. Azure Local supports the need for on-premises data and control:

Government and Military Sectors: Data confidentiality is paramount, requiring local management to ensure continuous access even in compromised network scenarios.
Energy Infrastructures: The stability of distribution networks and control of pipelines and refineries require resilience under limited connectivity conditions, while adhering to stringent regulations.

Azure’s Adaptive Cloud Approach

Microsoft’s adaptive cloud approach, enabled by Azure Arc, helps organizations unify hybrid, multicloud, and edge infrastructures within Azure. With Azure Arc, the same cloud-native experiences and capabilities—such as security, updates, management, and scalability—can be extended anywhere, from on-premises data centers to distributed locations.

Figure 2 – Adaptive Cloud Approach

Azure Local, connected to the cloud through Azure Arc, enables:

Operating and scaling distributed infrastructure via the Azure portal and the same APIs.
Running fundamental compute, network, storage, and application services locally, choosing hardware from the preferred vendor.
Strengthening the security of apps and data with Azure technologies, protecting them against advanced threats.

A key feature is the presence of Azure Kubernetes Service (AKS), Microsoft’s managed Kubernetes solution. On Azure Local, AKS can be configured and updated automatically, providing everything needed (storage drivers, container images for Linux and Windows, etc.) to support containerized applications. Moreover, each cluster is automatically enabled with Azure Arc, allowing integration with services like Microsoft Defender for Containers, Azure Monitor, and GitOps for continuous delivery.

Figure 3 – Bring Azure Apps, Data, and AI Anywhere

New Azure AI Services with Azure Local and Azure Arc

On-Premises Data Search with Generative AI

In recent years, generative AI has made significant strides, driven by the introduction of language models (like GPT) capable of interpreting and generating natural language text. Public tools like ChatGPT work well for general knowledge queries but cannot address questions about private business data on which they have not been trained. To bridge this gap, the concept of Retrieval Augmented Generation (RAG) was introduced, a technique that “enhances” language models with proprietary data, enabling more advanced and customized use cases.

Within the Azure Local framework, Microsoft has announced a new service that brings generative AI and RAG directly to the edge, where the data resides. Within minutes, organizations can deploy (via an Azure Arc extension) everything needed to query their on-premises data, including:

Small and large language models (SLM/LLM) running locally, with support for both CPUs and GPUs.
An end-to-end data ingestion and RAG pipeline that keeps all information on-premises, with RBAC (Role-Based Access Control) ensuring secure access.
An integrated tool for prompt engineering and result evaluation to optimize model settings and performance.
APIs and interfaces aligned with Azure standards, facilitating integration into enterprise applications, plus a preconfigured UI for immediate service use.

This feature is now available in private preview for Azure Local customers, with Microsoft planning to expand availability to other Arc-enabled platforms in the near future.

“Edge RAG”: The Local Retrieval-Augmented Generation Ecosystem

This new service, known as “Edge RAG”, integrates seamlessly into the Azure ecosystem and supports various input components, such as:

Azure AI Search: Provides document search and indexing functionality, enabling quick identification of relevant content within large datasets.
Azure OpenAI: Offers advanced AI models (like GPT) capable of generating, understanding, and summarizing text in natural language.
Azure AI Studio: A platform for developing and managing AI assets (datasets, models, pipelines) centrally.

Together, these components power an integrated flow—from data ingestion to inference and result presentation via chat or other development interfaces. This enables the creation of chatbots, knowledge discovery tools, and other AI-driven solutions that leverage internal business data in a secure, customizable, and compliant environment.

Deploying Open-Source AI Models via Azure Arc

Another key feature of Azure AI is the availability of a catalog of AI models tested, validated, and guaranteed by Microsoft. These models are ready for deployment and provide consistent inference endpoints. This functionality is now extended to the edge, where Microsoft makes selected models available directly from the Azure portal:

Phi-3.5 Mini (language model with 3.8 billion parameters)
Mistral 7B (language model with 7.3 billion parameters)
MMDetection YOLO (object detection)
OpenAI Whisper Large (speech-to-text recognition)
Google T5 Base (automatic translation)

These models can be deployed in just a few steps on an Arc AKS cluster running on-premises. Most models require only a CPU, but Phi-3.5 and Mistral 7B also support GPUs for enhanced performance in intensive inference scenarios.

Azure AI Offerings: From Cloud to Edge

Microsoft’s approach spans the full spectrum of AI capabilities, offering services and tools that can be delivered in the Azure cloud or extended to on-premises and edge environments via Azure Arc. The offering consists of four main pillars:

Application Development
- Azure AI Studio: A development environment for AI applications (e.g., chatbots, virtual agents) with a complete set of APIs and interfaces for seamless AI integration.
AI Services
- Azure AI Language and Model Services: Preconfigured services for NLP, computer vision, and other AI functionalities.
- Solutions like Edge RAG, Video Indexer, and Managed AI Containers for local deployment of “ready-to-use” AI models.
Machine Learning & ML Ops
- Azure Machine Learning Studio: A comprehensive platform for creating, training, optimizing, and managing machine learning models.
- With Azure Arc, ML Ops capabilities can extend to the edge via extensions like the AML Arc Extension, enabling Azure ML tools on on-premises and edge infrastructures.
Infrastructure
- Azure Global Infrastructure: Azure’s cloud foundation, including compute, storage, and networking resources.
- Arc-Enabled Edge Infrastructure: Extends Azure capabilities to data centers or edge devices, managed as if they were cloud resources.

Conclusion

Microsoft’s strategy is built on delivering the best of the cloud “anywhere.” Azure Local epitomizes this vision: a solution that brings all the benefits of the cloud—agility, scalability, security—directly to local environments, meeting the needs for low latency, operational continuity, and regulatory compliance.

Thanks to Azure Arc, organizations can leverage Azure AI services such as advanced language models, Retrieval-Augmented Generation (RAG) pipelines, and ML Ops tools in a hybrid mode. Applications range from factory quality control to retail theft prevention, from critical government data centers to energy infrastructure monitoring.

In a world where data continues to grow exponentially and the need for on-site analysis becomes increasingly urgent, solutions like Azure Local represent the next step toward a new generation of distributed infrastructures. This is how Microsoft meets the challenge of uniting cloud potential with on-premises reality, creating opportunities for innovation and growth across all sectors.

The Evolution of High Availability and Disaster Recovery in Modern Infrastructures: The Azure Local Case

High availability and disaster recovery solutions are playing an increasingly central role in modern infrastructure adoption strategies. Azure Local, Microsoft’s on-premises cloud-connected platform, exemplifies this transformation.

Starting with version 23H2, Azure Local introduces a new generation of features, moving away from the traditional Stretched Cluster model to propose more modern and flexible approaches designed to optimize resilience and simplify management. Through new configurations such as Rack Aware Cluster and disaster recovery support via Azure Site Recovery, Azure Local positions itself as a strategic platform for organizations seeking robust, scalable solutions aligned with the Azure ecosystem. In this article, we will explore the key features introduced in Azure Local version 23H2, analyzing the new high-availability options, disaster recovery strategies, and the impact of transitioning from Stretched Clusters to a more advanced model.

Azure Local, Version 23H2: An Arc-Enabled Evolution

The new version 23H2 marks a significant leap forward, transforming from a simple cloud-connected operating system to an Azure Arc-enabled solution with integrated features such as Arc Resource Bridge, Arc VM, and AKS. This transformation expands the possibilities for managing and controlling distributed environments, providing a unified administrative experience. Moreover, multi-site management extends beyond the operating system level, rendering the functionality of previous Stretched Clusters obsolete and introducing new paradigms of resilience and reliability.

High Availability Options

Rack Aware Cluster: High Availability for Short Distances

The standout feature for short-distance scenarios is the Rack Aware Cluster, a configuration that enables:

Deploying the cluster across two racks or rooms within the same Layer-2 network (e.g., within a manufacturing plant or campus).
Functioning as a local availability zone, ensuring fault isolation and optimal workload placement.

Figures 1 – Rack Aware Cluster: Network Architecture

This configuration offers an ideal solution for combining efficiency and ease of management in local environments. By leveraging a single storage pool, it reduces complexity and enhances overall efficiency, avoiding the overhead caused by excessive data replication. The Rack Aware Cluster is particularly suited for edge locations and can scale up to 8 nodes (4 per rack). Currently in private preview, public availability is expected by 2025.

Notably, even within Azure Local, the concept of availability zones has been introduced, aligning significantly with the established Azure model to ensure maximum reliability and operational continuity.

Disaster Recovery Options

Cloud Replication with Azure Site Recovery

For long-distance disaster recovery scenarios, Azure Local leverages Azure Site Recovery (ASR) to replicate on-premises virtual machines to the Azure cloud. This solution enables:

Replication: Transferring VM disks to an Azure storage account, safeguarding data from potential disasters.
Failover: Running replicated VMs directly in Azure during a disaster, ensuring operational continuity.
Re-protect: Replicating VMs back to the local cluster, maintaining a continuous protection cycle.
Failback: Bringing workloads back from the cloud to the on-premises system with minimal disruption.

These operations are managed centrally through the Azure portal, ensuring simplicity and efficiency for system administrators.

Local Replication with Hyper-V Replica

For workloads that cannot be moved to the cloud, Azure Local supports Hyper-V Replica, a solution that replicates Arc VMs to a secondary site. This approach allows:

Ensuring operational continuity by replicating data to a remote location.
Managing VM recovery as Hyper-V virtual machines at the secondary site and reverting to Arc VMs upon restoration on the primary cluster.

This feature, integrated into the Hyper-V role, represents an essential option for resilience in multi-site scenarios.

The Transition from Stretched Clusters

Introduced with Azure Local version 22H2, Stretched Clusters utilized Storage Replica to ensure resilience between two node groups located in distinct sites. This configuration:

Required at least two nodes per site and replicated storage synchronously to ensure data integrity in the event of failures.
Supported live migration of VMs between sites, facilitating smooth transitions for planned maintenance.

However, this solution required manual operations to reverse the direction of storage replication, a process that could introduce complexity and impact performance. With version 23H2, Stretched Clusters are no longer supported. Clusters configured with version 22H2 can still be partially upgraded to the 23H2 operating system, maintaining compatibility but without benefiting from the new features of the latest version.

For customers still using this configuration, it is advisable to consider adopting the new high availability and disaster recovery options offered by Azure Local, which guarantee greater efficiency and reliability.

Conclusions

The new features in Azure Local version 23H2 reflect a significant evolution toward more flexible, modern management aligned with the Azure ecosystem. With solutions like Rack Aware Cluster and integration with Azure Site Recovery, organizations can enhance the resilience of their local environments and ensure scalable and integrated disaster recovery options. Furthermore, abandoning the Stretched Cluster model paves the way for more efficient and streamlined configurations, enabling customers to fully leverage the potential offered by Azure technologies.

Ladies and Gentlemen, Welcome Azure Local!

Microsoft Ignite 2024 brought several exciting announcements, but one of the most significant was undoubtedly Azure Local. This is not merely a rebranding of Azure Stack HCI; it is a platform that redefines how we think about hybrid and on-premises infrastructures. Azure Local is designed to bring the essence of the cloud directly to local datacenters, offering a rich experience highly integrated with Azure services. With a suite of innovative features and a flexible approach, Azure Local promises to redefine the future of local infrastructures. Below, we explore all the updates on this solution.

A Name that Reflects a Vision

The name Azure Local is straightforward and on point. It represents the idea of having core Azure services—compute, networking, storage, and applications—available directly in local datacenters. This vision materializes through a cloud-connected platform that offers flexibility, scalability, and operational control.

Hardware: Choice, Flexibility, and New Opportunities

One of the most intriguing features of Azure Local is its wide range of supported hardware. With over 100 validated platforms, including major vendors like Dell and Lenovo, businesses can select solutions that best meet their needs and budget. Compatibility with GPUs like Nvidia A2, A16, and L40 makes Azure Local ideal for advanced workloads like artificial intelligence and virtual desktops.

Cost-Effective Options for the Edge

For environments with lighter compute requirements or tighter budgets, Azure Local supports micro, tower, and rugged hardware. This is a great opportunity for companies operating in edge or industrial environments. The minimum requirements include a compatible machine with an additional SSD and a 1 Gbps Ethernet network, eliminating the need for expensive switches. These options open new possibilities for deployments in remote or hard-to-reach locations, ensuring performance and consistency even in challenging operating conditions.

Simplified Provisioning

Thanks to the FIDO Device Onboard (FDO) protocol, onboarding machines is automated, greatly simplifying the activation of new edge nodes or IoT devices. This approach eliminates the need for complex manual interventions, making infrastructure deployment faster and more efficient.

Identity Management: With or Without Active Directory

Azure Local introduces long-awaited flexibility in identity management. If you don’t want to use on-premises Active Directory, the new “Local Identity” feature is available. This solution uses local accounts and certificates while retaining advanced functionalities like live VM migration. Additionally, local secrets are safeguarded with Azure Key Vault, ensuring high security levels even without external identity systems.

Centralized Management and Monitoring

One of Azure Local’s key strengths is its integration with Azure Arc, which extends Azure services to on-premises and other cloud environments. Infrastructure management happens directly from the Azure portal, where you can configure clusters, networking, and storage. For those seeking operational consistency, Azure Local allows configurations to be defined using ARM (Azure Resource Manager) templates, ensuring scalable and repeatable management. Furthermore, the Infrastructure-as-Code approach simplifies deployment in distributed environments, ensuring consistency and reducing errors.

Simplified Updates

Azure Local software updates come in a single monthly package, including drivers, firmware, and software stacks. This method enables sequential updates of physical machines while ensuring workload continuity. The ability to automatically orchestrate updates in multi-node environments is a significant advantage for organizations needing to minimize downtime.

Integrated Monitoring

Azure Local integrates natively with Azure Monitor, providing a unified view of all distributed resources. With over 50 standard metrics, preconfigured dashboards, and alert rules, businesses can monitor CPU, memory, storage, and network usage, setting up email notifications or automated actions in case of failures. Furthermore, data collection rules can be customized, and advanced dashboards can be created via Workbooks.

Figure 2 – Centralized visibility across all your locations

New Features and Services

Azure Local doesn’t stop at enhancing infrastructure—it also introduces new features and services that expand its usability.

Figure 3 – Azure Apps, Data, and AI in Azure Local

Migration from VMware

For organizations looking to move away from VMware, Azure Local offers a migration solution (in preview) via Azure Migrate. This tool enables the transfer of VMDKs to Azure Local, eliminating dependence on Broadcom and its associated costs. The migration process uses the same portal and APIs as Azure, ensuring a seamless experience for those already familiar with Azure tools.

Figure 4 – Migrating from VMware to Azure Local

PaaS and AI Services

Azure Local enables the use of Azure PaaS services like Azure Virtual Desktop and SQL Managed Instance. Additionally, the new Azure IoT Operations service provides a unified platform for edge data collection and analysis. For companies interested in AI, Azure Local introduces local AI search capabilities (preview) that leverage advanced language models to analyze on-premises data. This innovation opens new opportunities for process automation and data valorization.

Figure 5 – Azure AI Services with Azure Local

Disconnected Operations

For customers who cannot connect to the cloud due to regulatory or other reasons, Azure Local offers a disconnected option (in preview). In this configuration, Azure services, including the portal and Azure Resource Manager, are hosted locally, ensuring a consistent experience even without connectivity.

Figure 6 – Disconnected operations

Advanced Security

Security is a cornerstone of Azure Local, with new features enhancing resource protection.

Network Security Groups (NSG)

This functionality allows granular access rules between resources, filtering traffic based on parameters like source IP, port, and protocol. NSGs offer precise control over network traffic, reducing the risk of unauthorized access.

Figure 7 – Network Security Group in Azure Local

Trusted Launch

Azure Local introduces Trusted Launch, which protects VMs from rootkits and bootkits through Secure Boot and BitLocker encryption. This feature also ensures secure VM migration within the cluster, preserving data integrity and enhancing infrastructure resilience. Azure’s attestation services will also provide continuous system integrity monitoring, offering advanced security and visibility.

Getting Started

Existing Customers

Existing Azure Stack HCI customers need to do nothing—software updates will ensure a smooth transition to Azure Local, granting immediate access to new features.

New Installations

Azure Local is available in version 2411 for new deployments.

Virtual Sandbox

For those wanting to try Azure Local without dedicated hardware, Azure Arc Jumpstart offers a virtual sandbox environment, accessible via an Azure subscription. This option is ideal for testing features before deploying in production environments.

Conclusion

Microsoft Ignite 2024 highlighted a significant milestone in the hybrid infrastructure landscape with Azure Local. It’s not just an evolution of Azure Stack HCI but a platform that redefines how businesses leverage the cloud in their datacenters. With a focus on flexibility, integration, and security, Azure Local combines the best of the on-premises and cloud worlds, enabling organizations to adopt a truly connected and coherent hybrid strategy.

Its distinctive features, such as simplified provisioning, centralized management with Azure Arc, and support for disconnected scenarios, make it an ideal solution for addressing complex business needs.

Moreover, its attention to specific workloads like AI and virtual desktops, along with advanced security features like Trusted Launch and NSGs, strengthens Azure Local’s ability to adapt to diverse operational contexts.

Azure Local represents a significant step toward the future of hybrid infrastructures, delivering a seamless cloud experience directly to local datacenters. For both existing and new customers, this solution marks the beginning of a new era in IT resource management, bringing the cloud closer to business needs.

Unveiling the future: key insights from Microsoft Ignite 2024 on Azure IaaS and Azure Local

In this article, I delve into the latest technological advancements and strategic updates unveiled at the recent Microsoft Ignite 2024 event. With a specific focus on Azure Infrastructure as a Service (IaaS) and Azure Local, I aim to provide a comprehensive and insightful overview of the innovative solutions and initiatives introduced by Microsoft. As a cornerstone event in the tech world, Microsoft Ignite continues to shape the industry by presenting groundbreaking features, enhancements, and visionary developments. Join me as I explore these transformative updates in detail, offering my personal insights on their potential to redefine the future of cloud infrastructure and services. This article examines the implications of these transformative updates, analyzing their impact on the evolution of cloud infrastructure and services, and their significance for businesses navigating the digital future.

Azure

Silicon Updates for Azure Infrastructure

Microsoft Azure is advancing its infrastructure with end-to-end silicon innovations to meet the growing demands of cloud and AI workloads. Azure Integrated Hardware Security Module (HSM) ensures robust security across datacenter hardware, while Azure Boost Data Processing Units (DPUs) provide efficiency in networking, storage, and acceleration for scale-out workloads. Additionally, Azure’s innovative liquid cooling technology is tailored for large-scale AI systems, ensuring efficiency and sustainability within its datacenters. By integrating CPUs, AI accelerators, and DPUs, alongside cutting-edge hardware security and cooling technologies, Azure continues to optimize every layer of its infrastructure for the AI-driven era.

Azure HBv5 Virtual Machines Built for High Performance and Cost Efficiency (preview)

Azure HBv5 virtual machines are designed to redefine high-performance computing (HPC) in the cloud by delivering exceptional performance and cost efficiency. Powered by AMD EPYC™ 9V64H processors and the latest NVIDIA InfiniBand networking technologies, these VMs promise up to 8x the performance of leading bare metal and cloud alternatives, and up to 35x the speed of legacy on-premises systems. HBv5 VMs are optimized for demanding workloads such as computational fluid dynamics, weather modeling, and aerospace simulation. With enhanced data movement capabilities, high-bandwidth memory, and a co-designed platform to overcome bottlenecks, HBv5 will empower researchers and businesses to accelerate insights and reduce costs, with availability in preview by 2025.

Azure ND GB200 V6 VMs Powered by NVIDIA Blackwell Platform (preview)

Microsoft Azure has announced the preview of its Azure ND GB200 V6 virtual machines, powered by NVIDIA Blackwell GB200 Superchips. These VMs represent a breakthrough in AI computing, offering unparalleled performance and scalability for AI model training and inference. Co-developed and co-optimized with NVIDIA and other AI innovators, the Azure ND GB200 V6 series sets a new standard for AI supercomputing in the cloud. The integration of NVIDIA GB200 Superchips ensures accelerated capabilities for the most advanced AI workloads, enabling faster, more efficient AI innovation.

Microsoft Continues Transition to Reliable Logical Qubits

Microsoft is pioneering advancements at the intersection of AI and quantum computing by transitioning toward reliable logical qubits. In collaboration with Atom Computing, Microsoft is developing the world’s largest neutral atom commercial system with entangled logical qubits, offering breakthrough 2-qubit gate fidelity. These advancements will enable deeper, more complex quantum computations, surpassing classical computing capabilities. The co-designed commercial quantum machine, expected to launch by the end of 2025, will support faster AI training and accelerate scientific discovery, marking a significant leap in quantum innovation.

Azure Local

Azure Expands Adaptive Cloud, Introducing the Azure Local Infrastructure Solution

Microsoft Azure continues to innovate with its adaptive cloud approach, supporting global infrastructure across cloud and edge environments. This expansion offers unified management, enhanced security, simplified application deployment, and a consistent data foundation across hybrid, multicloud, and edge ecosystems. As part of this evolution, Azure Local—a cloud-controlled hybrid infrastructure platform powered by Azure Arc—is now generally available. Azure Arc acts as a bridge, extending Azure platform services like Azure Local across hybrid, multicloud, and edge locations.

What is Azure Local?

Azure Local enables customers to extend Azure services to distributed locations, empowering them to run mission-critical workloads, cloud-native applications, and AI solutions with unparalleled flexibility and scalability. Through partnerships with OEMs like Dell, HP, and Lenovo, Azure Local integrates secure, pre-validated hardware with cloud-based services. Supporting a variety of infrastructure setups, from compact industrial PCs to enterprise-grade servers, Azure Local also addresses disconnected scenarios, meeting rigorous regulatory and compliance requirements.

Azure Local’s Role in Azure’s Global Infrastructure

This new platform underscores Azure’s commitment to providing customers with unmatched options tailored to their unique needs. Whether leveraging Azure’s global presence in over 60 regions or third-party infrastructure enabled by Azure Arc, customers benefit from centralized management, advanced security features, and AI-driven insights. These capabilities accelerate app development and scaling while offering a unified experience across centralized and distributed environments.

Key Features and Benefits

Azure Local integrates and expands upon the Azure Stack product family, offering broader capabilities and a more streamlined experience. Existing Azure Stack HCI customers will automatically transition to Azure Local, which includes features like:

Customizable cloud-based operations and security
Support for cloud-native and traditional applications
Azure Virtual Desktop integration

New customers can explore validated partner solutions on the Azure Local webpage to get started today.

Azure Local vs. Azure Arc

Azure Local: Designed for customers seeking new or refreshed infrastructure at distributed locations, with Azure Arc capabilities seamlessly built-in.
Azure Arc: A bridge to extend Azure services to existing infrastructure or other cloud environments.

Azure Local’s Relationship with Azure Stack HCI

Azure Local now encompasses Azure Stack HCI, maintaining all its features and adding significant new functionality:

Support for lower-spec hardware (preview)
Disconnected operations (preview)
Enhanced services and flexibility

Existing customers need only continue applying updates to transition smoothly to Azure Local.

Transition for Azure Stack Hub and Azure Stack Edge

Microsoft recommends Azure Local for most distributed infrastructure scenarios. Once preview features such as lower-spec hardware and disconnected operations become generally available, Azure Local will offer equivalent capabilities to previous Azure Stack solutions. Until then, Azure Stack Hub and Azure Stack Edge remain available as standalone products.

Windows Server Integration

Azure Local also brings added value to Windows Server customers. Those with Software Assurance or active subscriptions can access Azure management tools like:

Azure Update Manager
Azure Policy Guest Configuration
Disaster Recovery
Change Tracking and Inventory

This integration incurs no additional cost, further enhancing Azure’s value proposition.

Getting Started

Azure Local is now available for production use (version 2411). New customers can browse the solutions catalog for their preferred vendor’s hardware and read the deployment guide to initiate their journey. Additional low-spec, cost-effective options are expected to launch soon.

Stay Informed with Microsoft Ignite: The Book of News

For more information, you can refer to “The Book of News,” the guide to Microsoft’s announcements for Microsoft Ignite. This resource is designed to streamline your access to the latest updates and provide essential insights into the topics that matter most to you.

Conclusion

The innovations unveiled at Microsoft Ignite 2024 mark a transformative leap in cloud infrastructure and hybrid solutions. From groundbreaking advancements in Azure IaaS with next-generation silicon, high-performance virtual machines, and pioneering AI capabilities to the introduction of Azure Local as a unified platform for distributed environments, Microsoft continues to redefine the standards of scalability, flexibility, and security.

These updates emphasize Azure’s commitment to empowering businesses with the tools needed to navigate the evolving digital landscape. Whether through enhanced performance for demanding workloads, seamless hybrid integration, or cutting-edge developments in quantum computing, Microsoft’s vision aligns with the growing demand for adaptive and resilient cloud solutions.

Azure Local’s seamless integration of Azure Stack HCI and the broader Azure ecosystem offers a compelling solution for organizations seeking a consistent and secure approach to managing workloads across centralized, hybrid, and edge environments. By bridging cloud-native and traditional applications, Azure Local simplifies infrastructure management while addressing complex compliance and operational needs.

As we look ahead, the innovations discussed at Microsoft Ignite 2024 set the stage for a future where cloud technologies continue to drive business transformation. By staying informed and embracing these advancements, organizations can unlock new levels of agility, innovation, and growth in an increasingly connected world.