MX4 Atlas Platform

Sovereign AI infrastructure

Deploy the full AI stack on your infrastructure—air-gapped, zero external calls, infrastructure activity journal. Sovereignty enforced at the architecture layer.

Schedule enterprise demo Deploy free POC pilot

Capabilities

What you can build

Foundational blocks to run any generative AI workload inside your sovereign perimeter.

Secure RAG Pipelines

Ingest millions of documents into a sovereign vector store. Connect SharePoint, OneDrive, file shares, or custom sources.

Automated Intelligence

Agents that analyze reports, process forms, and summarize contracts at scale.

Coding Assistants

Host an on-prem copilot fine-tuned on your repos—no IP leakage.

Data Analysis

Query SQL and warehouses in natural language to generate insights and charts.

Architecture

System architecture

A zero-trust request path from gateway to the infrastructure activity journal.

1. Atlas Core

Authenticate and route requests by routing rules

2. Atlas Runtime

Enforce zero-trust isolation and residency boundaries

3. Atlas Serve

Run inference on local GPUs

4. Activity Journal

Record infrastructure events locally

Components

Platform components

Five components, one sovereign platform.

The Unified Gateway

Atlas Core

Unified API gateway that routes prompts by cost, latency, and sovereignty. Handles auth, rate limits, and orchestration.

Tech Stack

FastAPIRedispgvector

https://mx4.ai/platform/atlas core

The Isolation Engine

Atlas Runtime

Isolation layer enforcing mTLS, activity journaling, and data residency at the infrastructure level.

Tech Stack

RusteBPFOPA

https://mx4.ai/platform/atlas runtime

The Inference Engine

Atlas Serve

Optimized inference for MENA workloads on A100/H100 with vLLM and high concurrency.

Tech Stack

vLLMTritonRay Serve

https://mx4.ai/platform/atlas serve

The Infrastructure Manager

Atlas Deploy

Air-gapped and private-cloud deployments with offline bundles and immutable rollout tracking.

Tech Stack

KubernetesHelmTerraform

https://mx4.ai/platform/atlas deploy

The Operations Console

Atlas Studio

Web console for models, costs, traces, and fine-tuning with full RTL and Arabic UI.

Tech Stack

Next.jsClickHouseTremor

https://mx4.ai/platform/atlas studio

Operational Assurance

Built for sovereignty, cost, and uptime

Operational controls that keep sovereignty measurable and predictable.

Default deny

Zero-egress enforcement

Outbound traffic is blocked by default; exceptions require operator approval and activity journaling.

Append-only

Infrastructure activity journal

Append-only activity journal retained locally per your policy.

Versioned

Model governance

Approved model registry with version pinning and rollback across environments.

Budgeted

Cost & usage controls

Per-team budgets, token caps, and automated routing to stay within spend.

Deployment

Deployment flexibility

Air-gapped, sovereign private cloud, or hybrid managed—choose what fits.

Air-Gapped / Offline

Complete isolation from the public internet. Updates are delivered via secure physical media or local registry. Ideal for Defense & National Security.

No Outbound Connections
Offline Model Bundles
Local Activity Journal
Physical Key Authentication

Sovereign Private Cloud

Deployed within your VPC (AWS Outposts, Azure Stack, or Local Data Center). Data never leaves your controlled perimeter.

Data Residency by Design
VPC Peering
Custom IAM Integration
Regional Failover

Hybrid Managed

Atlas Studio assisted by MX4; data stays in your infrastructure. Balanced operational ease and data security.

Optional Monitoring Integration
Scheduled Updates
Data residency by design
Usage-Based Billing

Infrastructure

Infrastructure requirements

Reference hardware profiles for dev, POC, and production at scale.

Profile	CPU	RAM	GPU	Storage
Development (1 Node)	4 vCPU	16 GB	None (CPU only)	100 GB NVMe
Proof of Concept (2 Nodes)	16 vCPU	64 GB	2x NVIDIA A100 (40GB)	500 GB NVMe
Production HA (3+ Nodes)	32+ vCPU	256+ GB	4-8x NVIDIA A100 (80GB)	2 TB+ NVMe RAID

Performance

Performance benchmarks

Atlas leads Arabic tasks while running entirely on your infrastructure.

Metric	MX4 Atlas (Falcon-7B)	GPT-4 (OpenAI)	Improvement	Context
Arabic QA Accuracy	91.2%	76.8%	+14.4%	Falcon-7B vs GPT-4 on SQuAD-style Arabic dataset
Cultural Context	94.7%	68.2%	+26.5%	Understanding of local idioms, laws, and customs
Dialect Handling	89.3%	72.1%	+17.2%	Performance on MSA, Gulf, and Levantine dialects
Inference Speed (Tokens/s)	45 t/s	18 t/s	2.5x Faster	Falcon-7B vs GPT-4 API response time

Sovereignty

Data sovereignty by design

Infrastructure-level isolation plus data residency controls you can verify.

Data Localization

Implementation: Customer-controlled regional storage

Verification: Activity journal

Access Controls

Implementation: RBAC + mTLS enforcement

Verification: Access reviews

Activity Journaling

Implementation: Append-only infrastructure journal

Verification: Cryptographic signatures

Data Encryption

Implementation: AES-256 at rest/transit

Verification: Key management controls

Scenarios

Deployment scenarios

Representative examples of how sovereign teams deploy Atlas.

Scenario: Government Ministry

Government

Focus: Data residency

“Atlas Runtime keeps Arabic document workflows on‑prem with zero external calls and infrastructure activity journaling.”

Digital transformation team

Outcome: Faster approvals

Scenario: Regional Bank

Financial Services

Focus: Cost control

“Atlas Core routes requests across local and premium models to balance cost and latency while staying on‑prem.”

AI innovation team

Outcome: Faster onboarding

Scenario: National Healthcare System

Healthcare

Focus: PHI boundaries

“Clinical support workflows run inside the hospital perimeter with strict access controls and local models.”

Clinical informatics team

Outcome: Operational efficiency

Scenario: Critical Infrastructure Operator

Critical Infrastructure

Focus: Air‑gapped

“Air‑gapped deployments enable offline inference with sealed update bundles.”

Security architecture team

Outcome: Offline operations

FAQ

Frequently asked questions

Answers for procurement, deployment, and security teams.

Need specifics?

Talk to our engineers and get a deployment blueprint.

Book a technical call

Can Atlas run completely offline in an air-gapped environment?

Yes. Atlas Deploy supports full offline deployment with bundled models, dependencies, and local container registries. Updates are delivered via secure physical media or dedicated secure channels. No outbound internet connection required.

What models are supported and can I bring my own?

Atlas supports all major open-source models (Llama, Falcon, Jais, Qwen, Mistral) and can route to OpenAI/Anthropic APIs when appropriate. You can also fine-tune models on your data and host them through Atlas Serve.

How long does deployment typically take?

Dev environments come online in ~10 minutes. The Free POC Pilot typically installs in ~30 minutes. Production deployments complete in under ~2 hours (including infrastructure checks); broader rollout timing depends on internal readiness.

What level of Arabic language support is provided?

Atlas is optimized for Modern Standard Arabic and major dialects (Gulf, Levantine, Egyptian). We route Arabic queries to native models like JAIS-13B and Falcon-7B which outperform GPT-4 on regional context by 25%+.

Is Atlas compatible with our existing ML infrastructure?

Yes. Atlas integrates with Kubernetes, supports standard OIDC/SAML authentication, works with existing observability stacks (Prometheus, Grafana), and provides OpenAI-compatible APIs for seamless migration.

Do you provide governance tooling?

Atlas focuses on infrastructure. Governance and policy enforcement live in your own stack; Atlas delivers residency controls, isolation, and activity journaling by design.

Can I deploy incrementally across environments?

Yes. Atlas is designed for incremental deployment: start with a POC (30 days, limited scale), validate use cases, then deploy production workloads on your infrastructure. The API remains identical across environments.

What kind of support and SLAs do you offer?

Enterprise customers get dedicated support channels, 4-hour response SLA for critical issues, quarterly business reviews, and direct access to our engineering team for architecture guidance.

Get Started

Ready to deploy sovereign AI?

Launch a free POC on your infrastructure. 30 days, zero external dependencies.

Deploy Free POC Pilot Schedule Enterprise Demo

Start with a 30-day POC • Typical deployment: 2–4 weeks