All guides
AISaaSGDPR

Data Protection & GDPR for AI

Your lawful basis for using personal data in training and inference, and how you handle automated decisions — where most AI products are exposed.

6 min read

A privacy policy is one piece. The harder questions — your lawful basis for using personal data in training and inference, and how you handle automated decisions — are where most AI products are exposed. This guide works through them.

Dossier

For
AI and SaaS companies handling personal data, especially those training on user data, making automated decisions, or selling to enterprise customers with data-protection requirements.
Covers
Lawful basis & data flows · training-data governance · automated decisions · DPAs · DPIAs.
The overlap
The EU AI Act governs the system; the GDPR governs the personal data it uses — AI products usually need both, kept consistent.
Reference points
GDPR (Regulation (EU) 2016/679) and the EDPB’s Opinion 28/2024 on AI models.

Establish your lawful basis and map your data flows

Under the GDPR, every use of personal data needs a lawful basis. Article 6 sets out six: consent, performance of a contract, a legal obligation, vital interests, a public-interest task, and legitimate interests. The exposure for AI products is usually not "do we have a policy" but "what is our basis for each use of personal data — collection, training, inference — and can we evidence it."

  • Map how personal data moves through your product and model, and attach a lawful basis to each distinct use.
  • Where you process special-category data (Article 9) — health, biometrics, and similar — you need an Article 9 condition on top of your Article 6 basis.
  • The basis you rely on for training can differ from the one for inference or deployment; treat them separately.

Get training-data governance right

The data behind your models needs sourcing, consent where relevant, and documentation you can stand behind. This has become one of the most scrutinised areas in AI data protection.

In December 2024 the European Data Protection Board (EDPB) issued Opinion 28/2024 on processing personal data in the context of AI models. Three points from it are worth building around:

  • Model anonymity isn’t automatic. Whether a trained model is “anonymous” is assessed case by case, and the threshold is high — you’d need to show the likelihood of extracting personal data about individuals from the model (directly or through queries) is insignificant.
  • Legitimate interest can be a valid basis for development and deployment, but only via a structured three-step test: a genuine legitimate interest, the necessity of the processing for it, and a balancing against the rights and reasonable expectations of the people whose data is used.
  • Mitigations strengthen the balance — measures like pseudonymisation, genuine transparency about data sources, and an opt-out all weigh in your favour in that assessment.

Handle automated decisions and profiling

Where your product makes or supports decisions about people, additional GDPR obligations attach. Article 22 gives individuals the right not to be subject to a decision based solely on automated processing — including profiling — that produces legal effects or similarly significantly affects them, save in defined circumstances and with safeguards.

  • Identify where your system makes or materially drives decisions about individuals (credit, eligibility, employment-related screening, and the like).
  • Where Article 22 is engaged, build in the transparency, the meaningful information about the logic involved, and the safeguards — such as human review — that the GDPR requires.
  • Tie this to your transparency notices under Articles 13–14, which are where individuals learn that such processing happens.

Write notices and DPAs built for an AI product

Two sets of documents do real work here, and neither should be lifted from a generic SaaS template:

  • Privacy notices (Articles 13–14) — user-facing information written for how your AI product actually uses data, including, where you rely on it, the enhanced transparency the EDPB associates with a legitimate-interest basis for AI models.
  • Data-processing agreements (Article 28) — the contracts your enterprise customers and vendors will require, governing processing carried out on their behalf. Enterprise buyers increasingly won’t sign without a DPA that fits an AI use case.

Run a DPIA and keep GDPR and the AI Act consistent

Where processing is likely to result in a high risk to individuals — often the case for AI involving large-scale or sensitive data, or automated decisions — the GDPR requires a Data Protection Impact Assessment (Article 35) before you start. It’s both an obligation and a useful forcing function for the analysis in steps 01–03.

Finally, keep the two regimes aligned. The EU AI Act governs the AI system; the GDPR governs the personal data that system uses. They overlap but aren’t the same, and AI products usually need both — so the smart move is to make your AI-governance and data-protection workstreams consistent rather than running them in separate silos that contradict each other.

Get your data protection in order

Book a consultation

Sources checked

GDPR — Regulation (EU) 2016/679, in particular Articles 5, 6, 9, 13–14, 22, 28 and 35; EDPB Opinion 28/2024 on certain data-protection aspects related to the processing of personal data in the context of AI models (adopted 17 December 2024), and EDPB Guidelines 1/2024 on Article 6(1)(f) legitimate interest referenced within it; interaction with the EU AI Act, Regulation (EU) 2024/1689. Application of Article 22 continues to be shaped by case law and supervisory guidance; confirm current interpretation for your facts.

guideAISaaSGDPR

More from the journal

See all

Google Engineer Charged with Commodities Fraud on Polymarket Using Confidential Data, May 2026

On 27 May 2026, the U.S. Attorney for the Southern District of New York unsealed a criminal complaint charging Michele Spagnuolo, a Google staff software engineer, with commodities fraud, wire fraud, and money laundering. Spagnuolo allegedly used confidential internal Google Search data to place approximately $2.75 million in bets on Polymarket event contracts tied to Google's Year in Search report between October and December 2025, netting roughly $1.2 million in profit. The CFTC filed a parallel civil action seeking penalties and trading bans.

Georgia Enacts Payment Stablecoin Act, Establishing Issuer Licensing Regime, May 2026

On 11 May 2026, Georgia signed HB 1272 (Act 452), the Georgia Payment Stablecoin Act, into law. The statute directs the Georgia Department of Banking and Finance to license stablecoin issuers incorporated under Georgia or foreign law. Licensed issuers must maintain one-to-one reserves of eligible assets and may only engage in stablecoin issuance, redemption, reserve management, and related custodial activities. The law takes effect on the earlier of 18 January 2027 or 120 days after federal GENIUS Act implementing regulations are finalized.

European Commission Opens Targeted Consultation on MiCA Review, May 2026

On 20 May 2026, the European Commission launched a targeted consultation on Regulation (EU) 2023/1114, the Markets in Crypto-Assets Regulation (MiCA), running alongside a parallel public consultation. The targeted consultation spans 86 questions across four thematic blocks and invites responses from industry representatives and public authorities. Submissions close 31 August 2026, with results feeding into the Commission's review reports under Articles 140 and 142 of MiCA.

Ready to launch without the regulatory guesswork?

Book a 30-minute consultation. We'll map your AI or licensing path and tell you exactly what's required, in plain language.