For companies

4bis. AI, models, and inference portability

The CTO saw it first in the weekly dashboard: median inference latency had drifted from 1.4 to 2.8 seconds over three weeks. No alerts. No incident. The provider's status page was green. But two days later, the internal eval suite came back. Contract extraction quality had dropped four points with no announced explanation. Customer-reported accuracy incidents had tripled over the same window. The Head of Customer Success was forwarding one complaint a day from enterprise clients in Germany and the Netherlands - until one of them terminated the contract for breach. Two days later, a competitor in Austin announced they had won that customer. That was the last straw.

The call with the LLM provider was professionally apologetic. A "Priority access" had been added to the enterprise offering. Unfortunately, this offering was not available outside of the US “yet”. Would it ever be ?

The product was built on GPT-4o. The prompts were tuned for it. The retrieval pipeline's embeddings could not be queried by any alternative model. Re-embedding the corpus would take three weeks of compute. Migrating off was a six-month project.

In this chapter:
Failure modes
Objectives
Solutions
- Abstract every production AI call
- Add an EU-hosted provider alongside the primary
- Build an open-weights fallback for MSS-critical flows
- Treat prompt portability as an engineering discipline
- Preserve the underlying assets
- Use BYOK and EU residency where available
- Govern workspace AI deliberately
- Alignment with the AI Act
Conclusion

Executive summary Introduction Introduction: Why now ? The benefits of digital preparedness How this guide is meant to be used The threats you are preparing for Choosing your posture Operational runbooks 1. Communication when everything is down 2. Identity and access survival 3. Define Minimum Survivable Service (MSS) 4. Vendor, supply

Introduction: Why now ?

Why now ? For more than a decade, companies have built on the assumption that major cloud providers are effectively permanent utilities: always available, globally reachable, and politically neutral. In the past months, each of those assumptions has been challenged. * In 2024, multiple undersea internet cables were sabotaged in the Baltic

The benefits of digital preparedness

Digital preparedness starts from a pragmatic assumption: serious digital disruptions are possible, even if they never materialize. The value of preparedness therefore exists in two very different futures: one where a disruption happens, and one where it does not. And both futures generate tangible benefits. When a disruption happens If

How this guide is meant to be used

This is a tactical guide This document is intentionally practical. It focuses on: * decisions you must make before a disruption, * actions you must take during a disruption, * and capabilities you must retain after a disruption. You will not find generic advice about “being resilient” or “embracing multi-cloud”. Instead, each chapter

4bis. AI, models, and inference portability

Read more

Table of contents

Introduction: Why now ?

The benefits of digital preparedness

How this guide is meant to be used