How long does deployment take?

From order to a fully running AI system: typically 2–4 weeks. On-site setup usually takes less than a day.

On-Premise AI Infrastructure

Run GPT-5-class AI
on hardware
you own.

On-premise LLM appliance for German SMEs. EU AI Act compliant. From €2,950. Pays back in under 18 months.

Your AI. Your data. Your server room.

EU AI Act takes effect August 2, 2026— Is your AI infrastructure ready?

GPU-Powered AI

8B–70B+

Parameter Models

Zero Cloud Dependency

30–100+

Users Supported

Complete Data Sovereignty

100%

EU AI Act & GDPR Ready

Pays for Itself

<18mo

Full Payback

Book a Consultation

Try the ROI Calculator

Built on NVIDIA Blackwell hardware

Who This Is For

Built for the right kind of buyer

We've kept the scope tight so the value is honest. If you're outside this profile, we'd rather tell you up front than waste your time.

Built for

Companies with sensitive data and real cloud-AI spend.

10–500 employees
Legal, medical, financial, defense supply chain, R&D-heavy manufacturing
Currently paying €15,000+ per year for cloud AI
Or barred from cloud AI for compliance reasons

Not for

If any of these sound like you, we're not a fit — yet.

Solo consultants and one-person shops
Companies under €2M annual revenue
Teams that need fewer than 5 AI users
Anyone looking for a cheap experiment, not a workhorse

The Problem

Cloud AI is draining your budget — every single month

Most companies don't realize how fast AI subscription costs compound. Here's what 100 users actually costs you per year.

With the EU AI Act taking effect August 2, 2026, companies using AI for high-risk tasks face mandatory compliance — or fines up to €35M.

ChatGPT Business

€25–30 / / user / month

€30,000–36,000

per year for 100 users

Claude Team

€25 / / user / month

€30,000

per year for 100 users

Enterprise Tier

€60+ / / user / month

€72,000+

per year for 100 users

Data Leaves Your Building

Every prompt, every document, every trade secret — sent to US servers

Unpredictable Costs

API bills spike with usage. Heavy months can 3–5× your estimate

Regulatory Risk

The EU AI Act (Aug 2026) classifies AI in law, healthcare, and finance as high-risk — cloud AI cannot meet the mandatory compliance requirements

Vendor Outages

When OpenAI or Anthropic goes down, your entire team stops working

The Solution

Enterprise AI that lives in your server room

A compact, GPU-powered appliance pre-loaded with state-of-the-art open-source AI models. Plug in, connect to your network, and start using AI in hours — not weeks.

WerkHub AI appliance — GPU-powered on-premise server

Compact. Silent. Powerful.

Fits in any standard server rack or on a desk

What your team sees

Summarize the Q1 revenue report and flag any anomalies.

Q1 revenue was €2.4M (+12% YoY). Two anomalies detected: DACH region declined 8% despite expansion, and SaaS churn spiked to 4.2% in March...

Running locally on your WerkHub AI appliance — no data leaves your network

NVIDIA Professional GPUs

DGX Spark (GB10, 128 GB unified) or RTX PRO 6000 Blackwell (96 GB GDDR7) — datacenter-grade, commercially licensed, 24/7 rated

8B – 120B+ Parameter Models

Llama 4, Gemma 4, Qwen 3.6, GLM-4.7, Devstral 2, Nemotron 3 Super — quantized for speed, tuned for business tasks. Swap models anytime.

Up to 100 Concurrent Users

Optimized inference stack (vLLM) handles 10–15 simultaneous requests with sub-second latency

RAG-Ready Architecture

Connect your documents, knowledge base, and internal data. The AI knows your business.

Pre-configured & Secure

Ships with everything installed. Air-gapped capable. No internet required for operation.

How It Works

From order to AI in three steps

No cloud accounts, no complex setup, no ongoing vendor management. We handle the hard parts so your team can focus on using AI.

We Configure

Tell us your team size, use cases, and compliance needs. We select the right GPU, models, and software stack — tailored to your workload.

Typical configuration takes 1–2 business days

We Deploy

Your pre-configured appliance ships ready to plug in. We assist with network integration, user onboarding, and initial model tuning — remotely or on-site.

From order to running AI: 2–4 weeks

Your Team Uses AI

Your employees access AI through a familiar chat interface — no training needed. Documents stay local, costs stay flat, and you stay in control.

Zero per-user fees, unlimited usage from day one

Why On-Premise

Four reasons to bring AI in-house

Data Sovereignty

Your prompts, documents, and trade secrets never leave your building. EU AI Act and GDPR compliant by architecture, not by promise. Full audit trails and data governance built in.

Predictable Costs

One-time hardware investment. No per-user fees, no per-token charges, no surprise bills. Your 101st request costs the same as your first — zero.

No Vendor Lock-in

Run Llama, Mistral, Qwen, or any open-source model. When the next breakthrough drops, download and deploy — no waiting for a provider to support it.

Always Available

No cloud outages. No API rate limits. No degraded performance during peak hours. Your AI runs on your network, on your schedule.

AI Agent Templates

240+ pre-built agent templates — ready for every department

Curated automation recipes across 22 industries — all running on-premise, behind your firewall, fully GDPR-compliant.

What we mean by "agent": a pre-built workflow template — system prompt + RAG over your documents + optional tool use (email, CRM, ERP, web). Not fully autonomous AI; you keep approval control at every step.

Core Capabilities

Document Drafting

Gemma 4 31B · Qwen 3.6 35B

Internal Knowledge Q&A

Nemotron 3 Super · Qwen3-Coder-Next 80B

Email Triage

Gemma 4 31B · Mistral Small 3.1

Technical Documentation

Devstral 2 · GLM-4.7 30B

Agentic Coding

Claude Code · Codex · OpenCode · OpenClaw

Multi-language Translation

Qwen 3.6 35B · Nemotron 3 Super

Featured Templates

Cash-Flow Runway Forecaster— 13-week forecast + runway view to anticipate shortfalls

Compliance Radar— Centralized compliance calendar with owners and backups

Customer Inbox Triage— Tags, SLAs, templates, and escalation rules

Hiring Pipeline Kit— Job posts, screening steps, interview scorecards, offer checklists

Featured Templates

Reconciliation Autopilot— Pull source data, propose matches, isolate exceptions, produce audit-ready packs

Document-to-Ledger Pipeline— Extract structured fields from invoices/statements into posting-ready entries

Tax Change Radar— Track regulatory updates and generate firm-specific checklists

Featured Templates

13-Week Cash Flow Forecaster— Rolling forecast with scenario analysis and variance explanations

Uncertainty Scenario Budget— Convert uncertainty into 3 scenarios with trigger thresholds

Expense Compliance Engine— Receipt capture, policy enforcement, and card reconciliation

Featured Templates

Customer Inbox Triage— Triage system with tags, SLAs, templates, escalation rules

Review Response Workflow— Auto-response templates for customer reviews

CRM-Lite Lead Follow-Up— Simple pipeline with automated follow-ups

Featured Templates

CRM-Lite Lead Follow-Up— Track leads with automated reminders

Source Hunter— Automated market research and comparables

Review Response Workflow— Respond to client and guest reviews

Featured Templates

Scope Change Control— Manage requirement volatility with structured change requests

Context Switching Guardrails— Reduce interruptions and batch shallow work

Task Whisperer— Track tasks, deadlines, and dependencies

All templates run locally on your WerkHub AI appliance. Your data never leaves your network.

Deploy 240+ templates on your hardware

240+ pre-built workflow templates across 24 industries — customize, combine, or build your own.

Early Access

Built for real deployments, not slide decks

WerkHub AI is engineered for SMEs that need GPT-5-class capability inside their own network. Early-access partners get hands-on engineering support and pricing locked at today's BOM.

5–500

User deployments by design

24/7

Designed for on-premise operation

100%

EU AI Act & GDPR ready

Frühzugang offen

Early-access program now open

First 10 customers receive 12 months of premium support and a free on-site discovery session (up to 500 km from our Haskovo/Sofia office) — no extra cost.

12 months premium support included
Free on-site discovery session
Pricing locked at current BOM

Request early access

Case Study

See a real deployment

Get the detailed PDF — hardware specs, integration approach, QA process, and project timeline from a recent build.

Want the detailed PDF with full specs?

Component list, architecture diagrams, and project timeline.

Hardware Tiers

Choose your configuration

Every appliance ships pre-configured with the full software stack — ready to deploy in hours. Custom configurations available on request.

Basic

Plug & play

from 2.950 €

5.900 € without subsidy

Blackwell (integrated)

1–10 users

3–5 simultaneous users

View Configuration

Entry

Small business

from 5.500 €

11.000 € without subsidy

2× Blackwell (integrated)

10–20 users

6–10 simultaneous users

View Configuration

Business

Enterprise

Maximum scale

65.000 €

Custom subsidy structuring — contact for details

4× NVIDIA RTX PRO 6000 Blackwell Max-Q

100–500 users

50–100 simultaneous users

View Configuration

All prices are estimated BOM + assembly. Final pricing depends on configuration, GPU availability, and current component market conditions.
Includes: hardware, pre-installed software stack, initial model deployment, and setup documentation.

Government Funding

Up to 50% government subsidies

German funding programs can significantly reduce your investment. Here are the key programs at a glance.

Digitalbonus Bayern

Plus

up to €30,000

Rate: 50%

Grant covering up to 50% of digitalization project costs (max €60k). Covers the full AI solution — software, deployment, integration, training, and appliance hardware.

Eligible: Bavarian SMEs, <50 employees, <€10M turnover

Innovationsgutschein

Bayern

up to €49,750

Rate: 50–60%

For custom AI development and innovation. Standard tier: up to €22,800 (60%). Special tier: up to €49,750 (50%). Covers R&D services, not hardware.

Eligible: Bavarian SMEs with R&D projects

Forschungszulage

Federal

25–35%

Rate: Tax credit

Federal R&D tax credit. 25% of eligible costs (35% for SMEs), up to €1M/year. No application deadline — claimed with your annual tax return.

Eligible: All German companies, any size, any state

All funding programs in detail

FAQ

Common questions

Everything you need to know about deploying on-premise AI with WerkHub AI.

Every appliance ships with enterprise-grade components rated for 24/7 operation. We offer optional maintenance contracts with next-business-day hardware replacement. The system uses RAID storage, so a single drive failure won't cause data loss. For critical deployments, we recommend a redundant configuration.

You have full control over model updates. When a new open-source model is released, we provide a tested update package that you can apply at your convenience — no forced updates, no surprise changes. Your team can also download and deploy models independently. We offer an optional managed update service if you prefer hands-off maintenance.

Yes. The appliance runs standard inference frameworks (vLLM, Ollama) that support any GGUF or safetensors model. If you have fine-tuned models from your own training pipeline, you can deploy them directly. We also offer fine-tuning services if you want to customize models for your specific domain.

From order to a fully running AI system: typically 2–4 weeks. The appliance ships pre-configured with your chosen models and software stack. On-site setup usually takes less than a day — connect power, connect to your network, and your team can start using AI immediately.

The EU AI Act (effective August 2, 2026) classifies many business AI uses as high-risk, requiring full transparency, audit trails, and data governance. With on-premise AI, your data never leaves your building — eliminating cross-border data transfer risks. You have complete control over model behavior, logging, and compliance documentation. We provide compliance documentation templates as part of the deployment.

Every appliance includes 12 months of technical support covering software configuration, model deployment assistance, and troubleshooting. Extended support contracts are available with guaranteed response times. We also provide initial user training and onboarding documentation for your team.

For a 100-person team, cloud AI subscriptions cost €30,000–72,000+ per year. The WerkHub AI appliance is a one-time investment starting at €5,900 (from €2,950 with Digitalbonus) and scaling to €65,000 for enterprise deployments. Minimal ongoing costs (electricity and optional maintenance). Most customers see full ROI within 12–18 months. Use our ROI calculator for a personalized comparison.

The appliance runs completely offline — no internet connection required for AI inference. This makes it suitable for high-security environments. Internet is only needed if you want to download new models or receive software updates, and even then, models can be transferred via USB for fully air-gapped setups.

Get Started

Book a free consultation

Tell us about your team and AI needs. We'll follow up with a tailored solution, configuration recommendation, and quote.

Run GPT-5-class AIon hardwareyou own.

Built for the right kind of buyer

Built for

Not for

Cloud AI is draining your budget — every single month

Data Leaves Your Building

Unpredictable Costs

Regulatory Risk

Vendor Outages

Enterprise AI that lives in your server room

NVIDIA Professional GPUs

8B – 120B+ Parameter Models

Up to 100 Concurrent Users

RAG-Ready Architecture

Pre-configured & Secure

From order to AI in three steps

We Configure

We Deploy

Your Team Uses AI

Four reasons to bring AI in-house

Data Sovereignty

Predictable Costs

No Vendor Lock-in

Always Available

240+ pre-built agent templates — ready for every department

Small Business

Accountants

Finance

Ecommerce

Real Estate

Project Managers

Built for real deployments, not slide decks

Early-access program now open

See a real deployment

Want the detailed PDF with full specs?

Choose your configuration

Basic

Entry

Business

Enterprise

Up to 50% government subsidies

Digitalbonus Bayern

Innovationsgutschein

Forschungszulage

Common questions

Book a free consultation

Run GPT-5-class AI
on hardware
you own.