Ark helps engineering and security teams share realistic test data without copying raw
production databases to the cloud. This page explains what runs where, what stays in your
network, and how to reach your first masked sandbox.
Your organization's isolated workspace — users, connections, jobs, and policies are scoped to one tenant.
Connection
A registered source or target database endpoint. Credentials are resolved inside your VPC by the agent, not stored as plaintext in SaaS when using env-ref mode.
Source profile
The governance object for a source DB: schema snapshot, classification report, and approved masking rules.
Config
A runnable recipe derived from a source profile — extraction limits, masking runtime, target connection, and test-environment settings.
Job
A unit of work dispatched to your agent: classification, subset export, synthetic data generation, or sandbox provisioning.
Test environment
An ephemeral MySQL or PostgreSQL database provisioned in your VPC from a config, pre-loaded with a masked subset.
05
First deployment
From tenant to first masked database
01
Ark provisions your tenant
Your Ark contact creates the tenant. An admin logs into the console and invites team members with the right roles.
02
Deploy the agent in your network
Create an agent in the console, copy the token, and run ark-agent in your VPC with outbound access to the control plane URL.
03
Register database connections
Add source (and optional target) connections. Test connectivity through the agent — the control plane never opens a direct SQL session.
04
Classify and approve masking
Run classification on a source profile. Security reviewers approve masking rules before data leaves production patterns.
05
Create a config and run your first job
Bind profile to target, set row limits, then trigger a subset or test-environment job. Monitor progress in the console or CLI.