I need you to sit with an uncomfortable truth for a moment: your staff are almost certainly sending confidential business data to third-party AI servers. Right now. Today.
They’re pasting client contracts into ChatGPT to summarise them. They’re uploading financial spreadsheets to Claude to analyse trends. They’re feeding proprietary strategy documents into Gemini to get feedback. They’re sharing customer data, internal communications, competitive intelligence, and trade secrets with AI services hosted on servers they don’t control, in jurisdictions they haven’t considered, under terms they haven’t read.
They’re not doing this maliciously. They’re doing it because these tools are genuinely useful, and nobody told them not to.
The question isn’t whether this is happening in your organisation. It’s whether you know about it.
The scale of the problem
Let me paint the picture concretely. In a typical mid-market business with 50 to 200 employees, I’d estimate:
- 70-80% of knowledge workers use AI tools at least weekly
- The majority use personal accounts or free tiers, not enterprise plans
- Almost none have been given explicit guidance on what data they can and cannot share
- Very few understand how their data is handled by the AI provider
This isn’t speculation. Every organisation I’ve worked with on AI governance has discovered the same pattern: widespread AI adoption with virtually no oversight.
Here’s what that looks like in practice:
| Role | What they’re sharing | Where it’s going |
|---|---|---|
| Sales team | Client proposals, pricing, deal terms | ChatGPT (free tier) |
| Finance | Revenue figures, forecasts, budgets | Claude, ChatGPT |
| Legal/compliance | Contracts, legal opinions, regulatory docs | Various AI tools |
| HR | Employee records, performance reviews, salary data | ChatGPT, Gemini |
| Product/engineering | Source code, architecture diagrams, roadmaps | Copilot, Claude, ChatGPT |
| Executives | Strategy documents, board papers, M&A materials | Whatever’s convenient |
Every row in that table is a data leakage vector. And in most organisations, there’s no policy, no monitoring, and no awareness.
Why this matters more than you think
“But these AI companies are trustworthy.” Maybe. But trust isn’t the only issue.
Data training and retention
On consumer and free-tier plans, most AI providers reserve the right to use your inputs to improve their models. OpenAI’s consumer ChatGPT plans, for instance, use conversations for training by default unless you specifically opt out. Even when providers say they don’t train on your data, they typically retain inputs for some period for abuse monitoring, debugging, or legal compliance.
The implication: the confidential data your sales manager pasted into ChatGPT last Tuesday might be influencing model outputs for other users. Your competitive intelligence might be, in some diffuse way, accessible to your competitors.
Regulatory exposure
If your organisation handles personal data — and nearly every organisation does — you likely have obligations under privacy legislation. The Australian Privacy Act, GDPR (if you deal with EU residents), and various industry-specific regulations all have requirements around how personal data is processed and where it’s transferred.
When your HR manager pastes employee performance reviews into a US-based AI service, that’s likely an international data transfer. When your legal team uploads a contract containing client personal information, that’s third-party data processing. Both have regulatory implications that your organisation is probably not addressing.
Contractual obligations
Many B2B contracts contain confidentiality clauses that restrict how shared information can be processed. When your team feeds a client’s proprietary data into a third-party AI tool, you may be in breach of your contractual obligations. The client didn’t consent to their data being processed by OpenAI or Anthropic.
Competitive risk
Your proprietary knowledge — pricing strategies, product roadmaps, technical architectures, client lists — is what differentiates you. Every time that information enters a third-party AI system, you’re accepting some degree of risk that it could be exposed, retained, or influence outputs that benefit others.
What to do about it
The wrong response is to panic and ban all AI tools. That’s counterproductive, and your staff will simply use them on personal devices where you have even less visibility. The right response is controlled enablement: make it easy to use AI safely, and hard to use it dangerously.
1. Establish enterprise AI accounts
The single most impactful step: get your team onto enterprise AI plans.
- ChatGPT Team/Enterprise — OpenAI does not train on data from these plans
- Claude for Business/Enterprise — Anthropic does not train on business plan data
- Gemini for Google Workspace — enterprise-grade data handling within Google’s ecosystem
Enterprise plans typically cost $20-60 per user per month. That’s trivially cheap compared to the cost of a data breach or regulatory fine. Make these the default, and make personal AI accounts for work purposes explicitly prohibited.
2. Create a data classification policy for AI
Not all data carries the same risk. Create a simple, practical classification system:
Green — safe to share with approved AI tools:
- Publicly available information
- Generic questions and research
- Non-sensitive internal processes
- De-identified data
Amber — share with caution (enterprise AI plans only):
- Internal business documents
- Non-sensitive client work (with client awareness)
- Financial summaries (not raw data)
- General strategy discussions
Red — never share with cloud AI tools:
- Personal data (employee or customer)
- Raw financial data and detailed forecasts
- Legal opinions and active litigation materials
- M&A and competitive intelligence
- Source code for proprietary systems
- Board papers and confidential executive communications
For red-classified data, the answer is either “don’t use AI for this” or “use local AI that never leaves your premises.”
3. Deploy local AI for sensitive work
This is where edge computing and local AI become genuinely important for enterprise use. For work involving confidential data, local LLMs running on your own hardware ensure that data never leaves your control.
Tools like Ollama make this practical today. A capable local model running on an Apple Silicon Mac can handle summarisation, classification, drafting, and analysis without any data ever touching an external server.
It won’t be as capable as GPT-4 or Claude for complex reasoning tasks. But for the majority of sensitive-data tasks — summarise this contract, classify these documents, draft a response to this email — it’s more than adequate. And the data stays yours.
4. Publish an approved tools list
Your team needs a clear, short list of approved AI tools and what each can be used for. Not a 40-page policy document that nobody reads. A single page:
- Approved tools: [List with links to enterprise logins]
- What you can use them for: [Clear examples]
- What you cannot share: [Specific, concrete examples relevant to your business]
- For sensitive data: [Point to local AI tools or alternatives]
- Questions? [Who to contact]
Pin it in Slack. Put it on the intranet. Reference it in onboarding. Make it impossible to miss.
5. Train your team with real examples
Generic “data security awareness” training is useless. Your team needs specific, relevant examples:
- “Don’t paste the Johnson & Co contract into ChatGPT to summarise it. Use [approved local tool] instead.”
- “You can use Claude to draft marketing copy, but don’t include actual revenue figures in your prompt.”
- “Before uploading any document to an AI tool, check: does this contain names, financial data, or information a client shared in confidence?”
Make it concrete. Make it about their actual workflow. Abstract policies get ignored. Specific examples change behaviour.
6. Audit and monitor
You can’t manage what you don’t measure. Establish a lightweight review process:
- Quarterly check-ins on which AI tools are being used across the organisation
- Review of AI-related spend (if people are expensing personal AI subscriptions, that’s a signal)
- Periodic spot-checks with team leads on how AI is being used in their departments
- An open channel for people to ask “can I use AI for this?” without fear of punishment
The conversation you need to have this week
Here’s my recommendation: before you do anything else, have one conversation. Gather your leadership team and ask a simple question: “What data do we think our team is sharing with AI tools?”
I guarantee the answer will be uncomfortable. And that discomfort is the starting point for building a responsible AI practice.
This isn’t about slowing down AI adoption. It’s about making sure your adoption doesn’t create a liability that overshadows the productivity gains. The governance framework doesn’t need to be perfect. It needs to exist.
Your staff aren’t the problem. The absence of guidance is the problem. Fix that, and you turn a risk into an advantage.
Frequently Asked Questions
Is data entered into ChatGPT used for training?
It depends on the plan. On free and Plus consumer plans, OpenAI may use your inputs to train future models unless you opt out in settings. On ChatGPT Team, Enterprise, and API plans, OpenAI does not train on your data by default. The same general pattern applies to most AI providers — consumer plans offer weaker data protections than enterprise plans. If your staff are using personal accounts or free tiers, assume your data is being used.
How do I know what data my employees are sharing with AI tools?
Most organisations have no visibility into this. Standard IT monitoring doesn’t typically track what text is pasted into a browser-based AI tool. The practical approach is threefold: first, establish a clear policy on what data can and cannot be shared with AI tools. Second, provide approved enterprise AI tools that offer proper data handling. Third, train your team with specific examples relevant to your business so they understand the boundaries.
Can I completely prevent data leakage to AI tools?
Not without severely restricting your team’s productivity. Blocking all AI tools is impractical and counterproductive — your competitors aren’t doing it, and your staff will find workarounds. The better approach is controlled enablement: provide approved tools with enterprise data agreements, create clear data classification policies, deploy local AI solutions for sensitive work, and build a culture where data security is understood rather than merely enforced.
Worried about what your team is sharing with AI? We can run a confidential AI security audit and help you build practical policies that protect your data without killing productivity. Talk to us.
If this sparked something, let's talk.
No pitch, no pressure — just a conversation about what you're working on.
Let's talkRelated posts
AI Implementation
Bolt-On AI vs DNA-Level AI: Why Surface-Level Adoption Fails
Most businesses are bolting AI onto broken workflows. Here's the difference between bolt-on AI and DNA-level AI — and the practical, low-risk way to move from one to the other without breaking the business.
AI Implementation
Data Model-Driven AI: Why Your Data Architecture Determines Your AI Ceiling
Your AI is only as good as your data model. Here's why data architecture is the overlooked foundation of every successful AI implementation — and how to fix yours.