Tampa, FL, USA
1 day ago
Data Engineer

Barbaricum is seeking a highly technical Data Engineer to drive the massive data discovery and classification effort for the Zero Trust initiative at U.S. Special Operations Command (USSOCOM). Before data can be protected, it must be found and understood. You will be responsible for illuminating "dark data" across the Command’s complex information environment, ranging from hyperscale cloud data lakes on NIPR to legacy file shares and isolated storage arrays on the SIPR and Top-Secret networks.

As a Data Engineer, you will architect and manage the deployment of advanced discovery platforms, specifically BigID and NetApp BlueXP. You will configure these tools to crawl petabytes of structured (SQL/Oracle), semi-structured (logs/NoSQL), and unstructured (SharePoint/File Shares) data. Your primary mission is to build the "Global Data Inventory"—a dynamic, real-time map of where sensitive CUI and classified intelligence resides—enabling the security teams to apply precision protection. You will use your knowledge of data pipelines and storage infrastructure to ensure that scanning operations provide 100% visibility without degrading network performance.

Responsibilities:

Data Discovery Architecture: Deploy and manage BigID and NetApp BlueXP scanners across hybrid environments, including configuring dockerized collectors for air-gapped discovery on the Top-Secret network. Structured Data Mapping: Connect discovery tools to enterprise databases (SQL Server, Oracle, PostgreSQL) to scan schemas and columns for PII, DoD ID numbers, and other sensitive indicators without impacting database performance. Unstructured Data Crawling: Configure scans for massive file repositories (NetApp NAS, QNAP, SharePoint On-Premises), optimizing scan windows and throttling to prevent latency for mission users. Cloud Data Integration: Utilize Microsoft Purview Data Map and custom connectors to inventory data residing in AWS S3 buckets, Azure Blobs, and Data Lakes. Classification Tuning: Collaborate with mission owners to train Machine Learning (ML) classifiers to recognize unique USSOCOM data types (e.g., mission names, operational codes) and reduce false positive rates in the data inventory.

Qualifications:

BA/BS or MA/MS (preferred) Years Exp: 3-10 CompTIA Security+ CE (or higher) to meet DoD 8570 IAT Level II requirements. Active Top-Secret clearance with SCI eligibility. Certified Kubernetes Administrator (CKA). Data Discovery Expertise: Proven experience deploying and managing enterprise data discovery and governance platforms such as BigID, Varonis, NetApp BlueXP (Data Sense), or Informatica. Storage & Database Knowledge: Strong understanding of storage protocols (NFS, SMB/CIFS, S3) and database structures (SQL, NoSQL) to troubleshoot connectivity and scanning access. Containerization: Proficiency with Kubernetes and Docker, as many modern discovery collectors are deployed as containerized microservices. Data Handling: Experience dealing with large-scale data inventories (Petabyte scale) and understanding of data lineage and provenance concepts.

Preferred:

Experience with Microsoft Purview Data Map (formerly Azure Purview). Knowledge of Regex for custom data pattern definition. Background in Data Engineering or ETL pipeline development. Familiarity with USSOCOM network architecture and storage standards. Azure Data Engineer Associate or BigID Certified Professional.
Confirm your E-mail: Send Email