Get the whitepaper that explains how GenAI is redefining data security and why security leaders need to pay attention.
Download now.

Discovering Your Data: Speed vs. Defensibility

June 17, 2025Reading time: 2 mins
Lane Sullivan
Chief Information Security and Strategy Officer
banner-bg-dawn

As unstructured data sprawls across today’s ecosystem (cloud, on-prem, and SaaS) at an unmanageable rate, the pressure to “find data faster” has never been higher.

In response, some vendors have turned to exploratory sampling: scanning only a subset of files and inferring the rest based on metadata — file type, size, storage location, or directory structure. Folder-level heuristics or clustering are then used to “fill in the gaps.”

But here’s the problem: If you don’t inspect every file, any assurance you give is speculative — and speculation isn’t defensible.

In data security, identifying sensitive data is only the first step— you must ensure discovery was intentional, comprehensive, and governed by control. That’s the essence of due care.

Put simply, exploratory sampling breaks the chain of due care — and with it, defensibility.

What are the technical risks with exploratory sampling?

·         Threat actors will find any weakness, including exploratory sampling

·         Exploratory sampling assumes uniformity — but unstructured data is inherently non-uniform

·         You can’t de-weaponize data if you don’t know what or where it is

Exploratory sampling may improve performance — but it creates blind spots that aren’t defensible to an auditor, regulator, or board.

Final Thought

In any data security program, visibility is the foundation of defensibility. If your technology relies on exploratory sampling — even if it sounds efficient — ask yourself this:

What will you say after a breach? How will you explain what you didn’t see?

That’s a risk most CISOs can’t afford to defend post-incident.

I encourage you to make your data protection strategy defensible by asking the right questions about exploratory sampling.

The latest from Concentric AI