Does the ChatGPT API use my data to train OpenAI's models?

Under OpenAI's default API terms, API data is not used for training. However, the terms have changed in the past and could change in the future. Enterprise customers who need contractual certainty should use OpenAI's enterprise tier, which includes explicit data handling commitments in a signed agreement. The default API terms, while currently opt-out of training, do not provide the contractual certainty that regulated-industry compliance requires.

What is the largest LLM that runs on a current flagship mobile device?

Current flagship devices (Apple A17 Bionic, Snapdragon 8 Gen 3) support models up to approximately 7B parameters with acceptable inference latency for user-facing features. At 7B parameters (quantized), inference takes 200 to 500ms for typical text queries. Models above 7B exceed the RAM envelope of most mobile devices or produce latency that makes them impractical.

Does a local LLM work offline?

Yes. A local LLM runs entirely on the device with no network dependency. For enterprise mobile apps used in environments with unreliable connectivity — clinical settings, field operations, remote facilities — a local LLM is the only way to deliver text AI features that work reliably offline.

What frameworks support local LLM deployment on iOS and Android?

Apple's Core ML supports deploying quantized models on iOS with NPU acceleration. Google's MediaPipe and ML Kit support model deployment on Android. LLM-specific frameworks include llama.cpp (cross-platform, widely used for Llama-family models), MLC-LLM (optimized for mobile NPU), and ExecuTorch (PyTorch's mobile inference framework). Framework choice depends on the model architecture and the target device hardware.

What is the HIPAA risk of using the ChatGPT API in a healthcare app?

Sending protected health information to the ChatGPT API without a signed Business Associate Agreement creates HIPAA exposure. OpenAI offers BAAs on enterprise tier agreements. Without a BAA, any PHI sent to the API is a potential HIPAA violation, regardless of what OpenAI does with the data. Healthcare apps should either use a local LLM for features that process PHI, or use the ChatGPT API only with a signed BAA on an enterprise tier agreement.

How much does it cost to build a local LLM feature into an enterprise mobile app?

Building a local LLM feature from scratch — model selection, quantization, on-device optimization, integration, and testing across the target device population — takes 6 to 12 weeks of specialized mobile AI engineering. At Wednesday's rates, that represents a $50,000 to $90,000 investment. The ongoing inference cost is $0 per query. The investment breaks even against cloud API fees in 12 to 24 months at moderate user volumes.

Writing

Local LLM vs ChatGPT API in Enterprise Mobile Apps: The Complete Risk and Cost Analysis for US Companies 2026

43% of US enterprises are unaware that default ChatGPT API usage may include data training rights. Here is the risk and cost analysis CISOs and CTOs need before choosing an approach.

Praveen Kumar · Technical Lead, Wednesday Solutions

9 min read·Published Dec 6, 2025·Updated Dec 6, 2025

4xfaster with AI

2xfewer crashes

10xmore work, same cost

4.8on Clutch

Trusted by teams at

In this article

The comparison CISOs are running right now
ChatGPT API: what you get and what you give up
Local LLM: what it actually means on mobile
Data handling and the training rights question
Risk analysis by regulated industry
Cost comparison
Frequently asked questions

43% of US enterprises are unaware that default ChatGPT API usage may include data training rights. The current API terms do not use data for training by default, but those terms have changed before and could change again. For regulated industries, "currently opt-out" is not the same as a contractual commitment — and one policy change away from a compliance problem is not an acceptable posture for a HIPAA-covered app or a SOC 2-audited platform.

This guide covers the risk and cost analysis that CISOs and CTOs need before deciding whether to use ChatGPT API, a local LLM, or a combination in their enterprise mobile apps.

Key findings

43% of US enterprises are unaware that default ChatGPT API usage may include data training rights. Regulated industries need contractual data handling commitments, not default policies that can change.

Local LLMs on current flagship devices (Snapdragon 8 Gen 3, Apple A17) support models up to 7B parameters with 200 to 500ms inference latency — sufficient for most enterprise text use cases.

ChatGPT API on the standard tier does not include a Business Associate Agreement. Healthcare apps sending PHI to the API without a BAA are in HIPAA exposure territory.

The cost of a local LLM feature breaks even against cloud API fees in 12 to 24 months at moderate user volumes. At enterprise scale, on-device inference is significantly cheaper over a 3-year horizon.

The comparison CISOs are running right now

The board mandate to "add AI" has landed on thousands of enterprise CTOs and CISOs in the past 18 months. The path of least resistance is the ChatGPT API: well-documented, familiar to engineers, fast to integrate, and capable of genuinely impressive results. Most enterprise mobile AI features can be prototyped with ChatGPT API integration in a week.

The question is whether that prototype path leads to a production-ready enterprise app, or to a compliance and cost problem that surfaces 12 months after launch.

The CISO's concern is not primarily cost. It is data. Every query sent to a cloud API contains user data that leaves the enterprise's control, is processed by a third-party system, and is subject to the API provider's data handling policies — policies that the enterprise did not negotiate and cannot unilaterally enforce.

For a retail app with non-sensitive user data, this risk is manageable. For a healthcare app with patient information, a financial services app with account data, or an HR app with employee information, the risk is a compliance exposure that auditors will find.

The local LLM alternative — running an AI model directly on the device — keeps data local. The inference happens on the phone. Nothing leaves the device. The compliance review of the AI feature is internal, not dependent on a third-party API provider's policies.

The tradeoff is engineering complexity and model capability. Local LLMs are harder to build and deploy than API integrations, and they are limited to models small enough to run on a phone.

ChatGPT API: what you get and what you give up

The ChatGPT API (and equivalent APIs from Anthropic, Google, and others) gives access to large foundation models via a simple network call. The engineering integration is straightforward — an HTTP request with the prompt, credentials, and parameters, returning a text response.

What you get:

Access to models with hundreds of billions of parameters. GPT-4o, Claude 3.5 Sonnet, and Gemini 1.5 Pro are capable of complex reasoning, long-form generation, multi-step analysis, and tasks that exceed the capability of any on-device model. For enterprise use cases that genuinely require this level of capability, cloud API is the practical choice.

Fast integration. An API integration can be built and tested in days. The engineering investment is low compared to on-device deployment.

Model improvements without app updates. When OpenAI improves GPT-4o, the improvement is available to all API callers without any action on the enterprise's part. The app always uses the current model version.

What you give up:

Data control. Every query leaves the device and is processed by OpenAI's (or Anthropic's, or Google's) infrastructure. The enterprise's data handling obligations do not transfer to the API provider without explicit contractual arrangements.

Offline capability. API features require connectivity. In environments where connectivity is intermittent, API-based AI features are unreliable.

Cost certainty. API costs scale directly with usage. A successful product with growing user adoption produces growing API costs that are difficult to cap.

Local LLM: what it actually means on mobile

A local LLM is a language model downloaded to the device and run using the device's processor and neural processing unit. The model lives in the app's storage. Inference happens locally. No network call is required.

The practical constraints on local LLMs for mobile are hardware and model size. Current flagship devices support models up to approximately 7B parameters with acceptable performance. Quantized models (reduced from full float32 precision to int4 or int8) run faster and use less memory, at a small quality penalty. A quantized 7B parameter model takes 4 to 6GB of device storage and 3 to 4GB of RAM during inference.

The use cases that 7B parameter models handle well include text classification, entity extraction, summarization, template completion, code assistance, and conversational interaction on defined topics. The use cases that exceed 7B model capability include complex multi-step reasoning, broad knowledge retrieval, and tasks that require the full capability of a frontier model.

For most enterprise mobile AI features — smart form filling, document summarization, workflow assistance, contextual search — a well-tuned 7B or smaller model performs at a level that users find genuinely useful. The gap between a 7B local model and a 70B cloud model is significant in benchmark terms. In practical terms, for the specific use cases a mobile enterprise app addresses, the gap often matters less than the privacy and offline benefits of the local option.

Talk to a Wednesday engineer about which AI architecture fits your enterprise mobile app's use case and compliance requirements.

Get my recommendation →

Data handling and the training rights question

OpenAI's current API terms do not use API data to train models by default. The enterprise can opt into data training, but opt-out is the default. This is the current state of the terms.

The concern is not what OpenAI currently does. It is whether the terms provide the level of contractual certainty that regulated industries require, and whether a policy change could create a compliance problem.

For enterprises with legal or compliance teams reviewing the AI architecture, "currently opt-out by default" is typically insufficient. Regulated industries need: a signed agreement specifying data handling obligations, a clear statement of where data is processed and stored, a BAA for PHI (for healthcare clients), a DPA under GDPR (for apps with EU users), and commitments about data retention and deletion.

OpenAI's enterprise tier provides these commitments through a negotiated agreement. The standard API tier does not.

The practical implications:

Healthcare apps sending PHI to the standard ChatGPT API tier: HIPAA exposure without a BAA
Financial services apps sending account data to the API: SOC 2 audit scope questions
Apps with EU users sending personal data to the API: GDPR assessment required
Any regulated app: legal review of the API provider's DPA or enterprise agreement before production deployment

A local LLM bypasses this category of risk entirely. No data leaves the device. No third-party data handling agreement is required for the AI inference layer. The compliance review is internal.

Risk analysis by regulated industry

Industry	ChatGPT API risk	Local LLM path
Healthcare (HIPAA)	PHI exposure without BAA on enterprise tier. BAA available on enterprise agreement.	Local LLM for features processing PHI. Cloud API (with BAA) only for non-PHI use cases.
Financial services (SOC 2, PCI)	API in SOC 2 audit scope. Account data requires DPA.	Local LLM for account-level queries. Cloud API for non-sensitive queries with DPA.
Insurance	State regulatory review of third-party data processors.	Local LLM default for policyholder data.
HR / workforce apps	Employee data protected under state privacy laws (CPRA, VCDPA).	Local LLM strongly preferred for employee data.
Field operations	Connectivity unreliable; API features fail offline.	Local LLM required for offline AI features.

The pattern across regulated industries is consistent: use local LLM for features that process sensitive data or need to work offline, and use cloud API (with appropriate data agreements) only for features where cloud model capability is genuinely required and data sensitivity permits.

Cost comparison

Approach	Engineering investment	Per-query cost	Annual cost at 50K DAU, 5 queries/day
ChatGPT API (GPT-4o mini)	Low (days)	$0.002-$0.006	$182,000-$547,000
ChatGPT API (GPT-4o)	Low (days)	$0.01-$0.02	$912,000-$1,825,000
Claude 3.5 Haiku	Low (days)	$0.001-$0.004	$91,000-$365,000
Local LLM (7B, on-device)	High ($50K-$90K build cost)	$0	$0

The local LLM breaks even against the cheapest cloud API (Claude Haiku at the low end) in 9 to 12 months at 50,000 DAU. Against GPT-4o at enterprise scale, the break-even is under 6 months.

The three-year total cost of ownership comparison at 50,000 DAU:

GPT-4o API (midpoint): $1,370,000 per year x 3 years = $4,110,000 plus the initial build cost
Local LLM: $70,000 build cost plus ongoing model updates ($10,000 to $20,000 per year) = $100,000 to $130,000 over three years

The cost difference at enterprise scale is decisive for use cases where a local LLM delivers sufficient quality.

Case study — Clinical digital health platform

0patient logs lost offline — seizures logged anywhere, synced automatically

“They really cared and felt like an extension of our team. The quality of the work was top notch, and they were receptive to shifting priorities.”

Founder, Digital health platformRead the case study →

The Wednesday approach

Wednesday evaluates every enterprise mobile AI feature against four questions before recommending a local LLM vs cloud API approach: what does the feature need to do (and can a 7B parameter model do it well enough), what data does the feature process (and what are the compliance implications of that data leaving the device), does the feature need to work offline, and what is the projected query volume at the expected user scale.

For healthcare and financial services clients, the starting position is local LLM where model capability is sufficient, with cloud API considered only for features where the capability gap is significant and the data can be processed under an appropriate enterprise data agreement.

For non-regulated clients, the analysis is driven by cost and capability. Cloud API for features that need frontier model capability. Local LLM for features that run well on smaller models, with a cost model that improves over the engagement lifetime.

The architecture decision is documented and reviewed with the CISO or technical buyer before development starts.

The data handling decision for your AI features should be made with legal and compliance input before development starts. Talk to Wednesday about the right architecture.

Book my 30-min call →

4.8 on Clutch

4x faster with AI2x fewer crashes100% money back

Frequently asked questions

Not ready for a call yet? Browse AI architecture guides, compliance analyses, and decision frameworks for enterprise mobile development.