We are now part of the NVIDIA Inception Program.Read the announcement
MX4 Atlas Platform

Sovereign AI infrastructure

Deploy the full AI stack on your infrastructure—air-gapped, zero external calls, infrastructure activity journal. Sovereignty enforced at the architecture layer.

Capabilities

What you can build

Foundational blocks to run any generative AI workload inside your sovereign perimeter.

Secure RAG Pipelines

Ingest millions of documents into a sovereign vector store. Connect SharePoint, OneDrive, file shares, or custom sources.

Automated Intelligence

Agents that analyze reports, process forms, and summarize contracts at scale.

Coding Assistants

Host an on-prem copilot fine-tuned on your repos—no IP leakage.

Data Analysis

Query SQL and warehouses in natural language to generate insights and charts.

Architecture

System architecture

A zero-trust request path from gateway to the infrastructure activity journal.

1. Atlas Core
Authenticate and route requests by routing rules
2. Atlas Runtime
Enforce zero-trust isolation and residency boundaries
3. Atlas Serve
Run inference on local GPUs
4. Activity Journal
Record infrastructure events locally

Components

Platform components

Five components, one sovereign platform.

The Unified Gateway

Atlas Core

Unified API gateway that routes prompts by cost, latency, and sovereignty. Handles auth, rate limits, and orchestration.

Tech Stack

FastAPIRedispgvector
https://mx4.ai/platform/atlas core
Atlas Core
The Isolation Engine

Atlas Runtime

Isolation layer enforcing mTLS, activity journaling, and data residency at the infrastructure level.

Tech Stack

RusteBPFOPA
https://mx4.ai/platform/atlas runtime
Atlas Runtime
The Inference Engine

Atlas Serve

Optimized inference for MENA workloads on A100/H100 with vLLM and high concurrency.

Tech Stack

vLLMTritonRay Serve
https://mx4.ai/platform/atlas serve
Atlas Serve
The Infrastructure Manager

Atlas Deploy

Air-gapped and private-cloud deployments with offline bundles and immutable rollout tracking.

Tech Stack

KubernetesHelmTerraform
https://mx4.ai/platform/atlas deploy
Atlas Deploy
The Operations Console

Atlas Studio

Web console for models, costs, traces, and fine-tuning with full RTL and Arabic UI.

Tech Stack

Next.jsClickHouseTremor
https://mx4.ai/platform/atlas studio
Atlas Studio

Operational Assurance

Built for sovereignty, cost, and uptime

Operational controls that keep sovereignty measurable and predictable.

Default deny

Zero-egress enforcement

Outbound traffic is blocked by default; exceptions require operator approval and activity journaling.

Append-only

Infrastructure activity journal

Append-only activity journal retained locally per your policy.

Versioned

Model governance

Approved model registry with version pinning and rollback across environments.

Budgeted

Cost & usage controls

Per-team budgets, token caps, and automated routing to stay within spend.

Deployment

Deployment flexibility

Air-gapped, sovereign private cloud, or hybrid managed—choose what fits.

Air-Gapped / Offline

Complete isolation from the public internet. Updates are delivered via secure physical media or local registry. Ideal for Defense & National Security.

  • No Outbound Connections
  • Offline Model Bundles
  • Local Activity Journal
  • Physical Key Authentication

Sovereign Private Cloud

Deployed within your VPC (AWS Outposts, Azure Stack, or Local Data Center). Data never leaves your controlled perimeter.

  • Data Residency by Design
  • VPC Peering
  • Custom IAM Integration
  • Regional Failover

Hybrid Managed

Atlas Studio assisted by MX4; data stays in your infrastructure. Balanced operational ease and data security.

  • Optional Monitoring Integration
  • Scheduled Updates
  • Data residency by design
  • Usage-Based Billing

Infrastructure

Infrastructure requirements

Reference hardware profiles for dev, POC, and production at scale.

ProfileCPURAMGPUStorage
Development (1 Node)4 vCPU16 GBNone (CPU only)100 GB NVMe
Proof of Concept (2 Nodes)16 vCPU64 GB2x NVIDIA A100 (40GB)500 GB NVMe
Production HA (3+ Nodes)32+ vCPU256+ GB4-8x NVIDIA A100 (80GB)2 TB+ NVMe RAID

Performance

Performance benchmarks

Atlas leads Arabic tasks while running entirely on your infrastructure.

MetricMX4 Atlas (Falcon-7B)GPT-4 (OpenAI)ImprovementContext
Arabic QA Accuracy91.2%76.8%+14.4%Falcon-7B vs GPT-4 on SQuAD-style Arabic dataset
Cultural Context94.7%68.2%+26.5%Understanding of local idioms, laws, and customs
Dialect Handling89.3%72.1%+17.2%Performance on MSA, Gulf, and Levantine dialects
Inference Speed (Tokens/s)45 t/s18 t/s2.5x FasterFalcon-7B vs GPT-4 API response time

Sovereignty

Data sovereignty by design

Infrastructure-level isolation plus data residency controls you can verify.

Data Localization

Implementation: Customer-controlled regional storage
Verification: Activity journal

Access Controls

Implementation: RBAC + mTLS enforcement
Verification: Access reviews

Activity Journaling

Implementation: Append-only infrastructure journal
Verification: Cryptographic signatures

Data Encryption

Implementation: AES-256 at rest/transit
Verification: Key management controls

Scenarios

Deployment scenarios

Representative examples of how sovereign teams deploy Atlas.

Scenario: Government Ministry

Government

Focus: Data residency

Atlas Runtime keeps Arabic document workflows on‑prem with zero external calls and infrastructure activity journaling.

Digital transformation team

Outcome: Faster approvals

Scenario: Regional Bank

Financial Services

Focus: Cost control

Atlas Core routes requests across local and premium models to balance cost and latency while staying on‑prem.

AI innovation team

Outcome: Faster onboarding

Scenario: National Healthcare System

Healthcare

Focus: PHI boundaries

Clinical support workflows run inside the hospital perimeter with strict access controls and local models.

Clinical informatics team

Outcome: Operational efficiency

Scenario: Critical Infrastructure Operator

Critical Infrastructure

Focus: Air‑gapped

Air‑gapped deployments enable offline inference with sealed update bundles.

Security architecture team

Outcome: Offline operations

FAQ

Frequently asked questions

Answers for procurement, deployment, and security teams.

Need specifics?

Talk to our engineers and get a deployment blueprint.

Book a technical call

Can Atlas run completely offline in an air-gapped environment?

Yes. Atlas Deploy supports full offline deployment with bundled models, dependencies, and local container registries. Updates are delivered via secure physical media or dedicated secure channels. No outbound internet connection required.

What models are supported and can I bring my own?

Atlas supports all major open-source models (Llama, Falcon, Jais, Qwen, Mistral) and can route to OpenAI/Anthropic APIs when appropriate. You can also fine-tune models on your data and host them through Atlas Serve.

How long does deployment typically take?

Dev environments come online in ~10 minutes. The Free POC Pilot typically installs in ~30 minutes. Production deployments complete in under ~2 hours (including infrastructure checks); broader rollout timing depends on internal readiness.

What level of Arabic language support is provided?

Atlas is optimized for Modern Standard Arabic and major dialects (Gulf, Levantine, Egyptian). We route Arabic queries to native models like JAIS-13B and Falcon-7B which outperform GPT-4 on regional context by 25%+.

Is Atlas compatible with our existing ML infrastructure?

Yes. Atlas integrates with Kubernetes, supports standard OIDC/SAML authentication, works with existing observability stacks (Prometheus, Grafana), and provides OpenAI-compatible APIs for seamless migration.

Do you provide governance tooling?

Atlas focuses on infrastructure. Governance and policy enforcement live in your own stack; Atlas delivers residency controls, isolation, and activity journaling by design.

Can I deploy incrementally across environments?

Yes. Atlas is designed for incremental deployment: start with a POC (30 days, limited scale), validate use cases, then deploy production workloads on your infrastructure. The API remains identical across environments.

What kind of support and SLAs do you offer?

Enterprise customers get dedicated support channels, 4-hour response SLA for critical issues, quarterly business reviews, and direct access to our engineering team for architecture guidance.

Get Started

Ready to deploy sovereign AI?

Launch a free POC on your infrastructure. 30 days, zero external dependencies.

Start with a 30-day POC • Typical deployment: 2–4 weeks