Skip to main content
On-Premise AI Infrastructure

Run GPT-5-class AI
on hardware
you own.

On-premise LLM appliance for German SMEs. EU AI Act compliant. From €2,950. Pays back in under 18 months.

Your AI. Your data. Your server room.

EU AI Act takes effect August 2, 2026— Is your AI infrastructure ready?
GPU-Powered AI
8B–70B+
Parameter Models
Zero Cloud Dependency
30–100+
Users Supported
Complete Data Sovereignty
100%
EU AI Act & GDPR Ready
Pays for Itself
<18mo
Full Payback

Built on NVIDIA Blackwell hardware

NVIDIASupermicro
Who This Is For

Built for the right kind of buyer

We've kept the scope tight so the value is honest. If you're outside this profile, we'd rather tell you up front than waste your time.

Built for

Companies with sensitive data and real cloud-AI spend.

  • 10–500 employees
  • Legal, medical, financial, defense supply chain, R&D-heavy manufacturing
  • Currently paying €15,000+ per year for cloud AI
  • Or barred from cloud AI for compliance reasons

Not for

If any of these sound like you, we're not a fit — yet.

  • Solo consultants and one-person shops
  • Companies under €2M annual revenue
  • Teams that need fewer than 5 AI users
  • Anyone looking for a cheap experiment, not a workhorse
The Problem

Cloud AI is draining your budget — every single month

Most companies don't realize how fast AI subscription costs compound. Here's what 100 users actually costs you per year.

With the EU AI Act taking effect August 2, 2026, companies using AI for high-risk tasks face mandatory compliance — or fines up to €35M.

ChatGPT Business
€25–30 / / user / month
€30,000–36,000
per year for 100 users
Claude Team
€25 / / user / month
€30,000
per year for 100 users
Enterprise Tier
€60+ / / user / month
€72,000+
per year for 100 users

Data Leaves Your Building

Every prompt, every document, every trade secret — sent to US servers

Unpredictable Costs

API bills spike with usage. Heavy months can 3–5× your estimate

Regulatory Risk

The EU AI Act (Aug 2026) classifies AI in law, healthcare, and finance as high-risk — cloud AI cannot meet the mandatory compliance requirements

Vendor Outages

When OpenAI or Anthropic goes down, your entire team stops working
The Solution

Enterprise AI that lives in your server room

A compact, GPU-powered appliance pre-loaded with state-of-the-art open-source AI models. Plug in, connect to your network, and start using AI in hours — not weeks.

WerkHub AI appliance — GPU-powered on-premise server
Compact. Silent. Powerful.
Fits in any standard server rack or on a desk
What your team sees
Summarize the Q1 revenue report and flag any anomalies.
Q1 revenue was €2.4M (+12% YoY). Two anomalies detected: DACH region declined 8% despite expansion, and SaaS churn spiked to 4.2% in March...
Running locally on your WerkHub AI appliance — no data leaves your network

NVIDIA Professional GPUs

DGX Spark (GB10, 128 GB unified) or RTX PRO 6000 Blackwell (96 GB GDDR7) — datacenter-grade, commercially licensed, 24/7 rated

8B – 120B+ Parameter Models

Llama 4, Gemma 4, Qwen 3.6, GLM-4.7, Devstral 2, Nemotron 3 Super — quantized for speed, tuned for business tasks. Swap models anytime.

Up to 100 Concurrent Users

Optimized inference stack (vLLM) handles 10–15 simultaneous requests with sub-second latency

RAG-Ready Architecture

Connect your documents, knowledge base, and internal data. The AI knows your business.

Pre-configured & Secure

Ships with everything installed. Air-gapped capable. No internet required for operation.
How It Works

From order to AI in three steps

No cloud accounts, no complex setup, no ongoing vendor management. We handle the hard parts so your team can focus on using AI.

01

We Configure

Tell us your team size, use cases, and compliance needs. We select the right GPU, models, and software stack — tailored to your workload.

Typical configuration takes 1–2 business days
02

We Deploy

Your pre-configured appliance ships ready to plug in. We assist with network integration, user onboarding, and initial model tuning — remotely or on-site.

From order to running AI: 2–4 weeks
03

Your Team Uses AI

Your employees access AI through a familiar chat interface — no training needed. Documents stay local, costs stay flat, and you stay in control.

Zero per-user fees, unlimited usage from day one
Why On-Premise

Four reasons to bring AI in-house

Data Sovereignty

Your prompts, documents, and trade secrets never leave your building. EU AI Act and GDPR compliant by architecture, not by promise. Full audit trails and data governance built in.

Predictable Costs

One-time hardware investment. No per-user fees, no per-token charges, no surprise bills. Your 101st request costs the same as your first — zero.

No Vendor Lock-in

Run Llama, Mistral, Qwen, or any open-source model. When the next breakthrough drops, download and deploy — no waiting for a provider to support it.

Always Available

No cloud outages. No API rate limits. No degraded performance during peak hours. Your AI runs on your network, on your schedule.
AI Agent Templates

240+ pre-built agent templates — ready for every department

Curated automation recipes across 22 industries — all running on-premise, behind your firewall, fully GDPR-compliant.

What we mean by "agent": a pre-built workflow template — system prompt + RAG over your documents + optional tool use (email, CRM, ERP, web). Not fully autonomous AI; you keep approval control at every step.
Core Capabilities
Document Drafting
Gemma 4 31B · Qwen 3.6 35B
Internal Knowledge Q&A
Nemotron 3 Super · Qwen3-Coder-Next 80B
Email Triage
Gemma 4 31B · Mistral Small 3.1
Technical Documentation
Devstral 2 · GLM-4.7 30B
Agentic Coding
Claude Code · Codex · OpenCode · OpenClaw
Multi-language Translation
Qwen 3.6 35B · Nemotron 3 Super
Featured Templates
Cash-Flow Runway Forecaster13-week forecast + runway view to anticipate shortfalls
Compliance RadarCentralized compliance calendar with owners and backups
Customer Inbox TriageTags, SLAs, templates, and escalation rules
Hiring Pipeline KitJob posts, screening steps, interview scorecards, offer checklists
Featured Templates
Reconciliation AutopilotPull source data, propose matches, isolate exceptions, produce audit-ready packs
Document-to-Ledger PipelineExtract structured fields from invoices/statements into posting-ready entries
Tax Change RadarTrack regulatory updates and generate firm-specific checklists
Featured Templates
13-Week Cash Flow ForecasterRolling forecast with scenario analysis and variance explanations
Uncertainty Scenario BudgetConvert uncertainty into 3 scenarios with trigger thresholds
Expense Compliance EngineReceipt capture, policy enforcement, and card reconciliation
Featured Templates
Customer Inbox TriageTriage system with tags, SLAs, templates, escalation rules
Review Response WorkflowAuto-response templates for customer reviews
CRM-Lite Lead Follow-UpSimple pipeline with automated follow-ups
Featured Templates
CRM-Lite Lead Follow-UpTrack leads with automated reminders
Source HunterAutomated market research and comparables
Review Response WorkflowRespond to client and guest reviews
Featured Templates
Scope Change ControlManage requirement volatility with structured change requests
Context Switching GuardrailsReduce interruptions and batch shallow work
Task WhispererTrack tasks, deadlines, and dependencies
All templates run locally on your WerkHub AI appliance. Your data never leaves your network.

240+ pre-built workflow templates across 24 industries — customize, combine, or build your own.

Early Access

Built for real deployments, not slide decks

WerkHub AI is engineered for SMEs that need GPT-5-class capability inside their own network. Early-access partners get hands-on engineering support and pricing locked at today's BOM.

5–500
User deployments by design
24/7
Designed for on-premise operation
100%
EU AI Act & GDPR ready
Frühzugang offen

Early-access program now open

First 10 customers receive 12 months of premium support and a free on-site discovery session (up to 500 km from our Haskovo/Sofia office) — no extra cost.

  • 12 months premium support included
  • Free on-site discovery session
  • Pricing locked at current BOM
Request early access
Case Study

See a real deployment

Get the detailed PDF — hardware specs, integration approach, QA process, and project timeline from a recent build.

Want the detailed PDF with full specs?

Component list, architecture diagrams, and project timeline.

Hardware Tiers

Choose your configuration

Every appliance ships pre-configured with the full software stack — ready to deploy in hours. Custom configurations available on request.

All prices are estimated BOM + assembly. Final pricing depends on configuration, GPU availability, and current component market conditions.
Includes: hardware, pre-installed software stack, initial model deployment, and setup documentation.

Government Funding

Up to 50% government subsidies

German funding programs can significantly reduce your investment. Here are the key programs at a glance.

Digitalbonus Bayern

Plus
up to €30,000
Rate: 50%

Grant covering up to 50% of digitalization project costs (max €60k). Covers the full AI solution — software, deployment, integration, training, and appliance hardware.

Eligible: Bavarian SMEs, <50 employees, <€10M turnover

Innovationsgutschein

Bayern
up to €49,750
Rate: 50–60%

For custom AI development and innovation. Standard tier: up to €22,800 (60%). Special tier: up to €49,750 (50%). Covers R&D services, not hardware.

Eligible: Bavarian SMEs with R&D projects

Forschungszulage

Federal
25–35%
Rate: Tax credit

Federal R&D tax credit. 25% of eligible costs (35% for SMEs), up to €1M/year. No application deadline — claimed with your annual tax return.

Eligible: All German companies, any size, any state
FAQ

Common questions

Everything you need to know about deploying on-premise AI with WerkHub AI.

Every appliance ships with enterprise-grade components rated for 24/7 operation. We offer optional maintenance contracts with next-business-day hardware replacement. The system uses RAID storage, so a single drive failure won't cause data loss. For critical deployments, we recommend a redundant configuration.

You have full control over model updates. When a new open-source model is released, we provide a tested update package that you can apply at your convenience — no forced updates, no surprise changes. Your team can also download and deploy models independently. We offer an optional managed update service if you prefer hands-off maintenance.

Yes. The appliance runs standard inference frameworks (vLLM, Ollama) that support any GGUF or safetensors model. If you have fine-tuned models from your own training pipeline, you can deploy them directly. We also offer fine-tuning services if you want to customize models for your specific domain.

From order to a fully running AI system: typically 2–4 weeks. The appliance ships pre-configured with your chosen models and software stack. On-site setup usually takes less than a day — connect power, connect to your network, and your team can start using AI immediately.

The EU AI Act (effective August 2, 2026) classifies many business AI uses as high-risk, requiring full transparency, audit trails, and data governance. With on-premise AI, your data never leaves your building — eliminating cross-border data transfer risks. You have complete control over model behavior, logging, and compliance documentation. We provide compliance documentation templates as part of the deployment.

Every appliance includes 12 months of technical support covering software configuration, model deployment assistance, and troubleshooting. Extended support contracts are available with guaranteed response times. We also provide initial user training and onboarding documentation for your team.

For a 100-person team, cloud AI subscriptions cost €30,000–72,000+ per year. The WerkHub AI appliance is a one-time investment starting at €5,900 (from €2,950 with Digitalbonus) and scaling to €65,000 for enterprise deployments. Minimal ongoing costs (electricity and optional maintenance). Most customers see full ROI within 12–18 months. Use our ROI calculator for a personalized comparison.

The appliance runs completely offline — no internet connection required for AI inference. This makes it suitable for high-security environments. Internet is only needed if you want to download new models or receive software updates, and even then, models can be transferred via USB for fully air-gapped setups.

Get Started

Book a free consultation

Tell us about your team and AI needs. We'll follow up with a tailored solution, configuration recommendation, and quote.

Required