Data Vault
The $3.5M Strategic Asset Foundation
CLONES launches with the largest repository of Computer Use Agent training data in Web3 — providing immediate market leadership and enterprise credibility through proven, validated assets.
Complete Asset Breakdown
User Action Sequences
Volume: 24,800,000 actions
Rate: $0.08 per action
Value: $1,984,000
Coverage: Complete interaction sequences
Mouse Event Data
Volume: 24,600,000 events
Rate: $0.05 per event
Value: $1,230,000
Coverage: Precision cursor tracking
Structured Text Prompts
Volume: 463,000 prompts
Rate: $0.50 per prompt
Value: $231,500
Coverage: Task instructions and contextual guidance
Keyboard Input Data
Volume: 294,000 events
Rate: $0.20 per event
Value: $58,800
Coverage: Text input and shortcuts
Video Demonstrations
Volume: 1,550,000 minutes
Rate: $0.0015 per minute
Value: $2,325
Coverage: Complete workflow screen recordings
Valuation Methodology
Industry Rate Analysis
Enterprise AI datasets: $50,000-$200,000 per specialized collection
Manual annotation: $0.10-$5.00 per data point
Screen recording: $0.005-$0.015 per minute
Workflow documentation: $0.25-$2.00 per action sequence
Our Conservative Approach
Video data: $0.0015/min (67% below market average)
Action sequences: $0.08/action (competitive with annotation services)
Mouse events: $0.05/event (below specialized tracking rates)
Text prompts: $0.50/prompt (standard for structured instructions)
Quality Standards
Algorithmic scoring through validation systems
Human verification for critical workflows
Task completion tracking with success metrics
Enterprise-ready formatting for immediate deployment
How Did We Acquire This Edge?
We identified an opportunity where a team had spent 12 months collecting Computer Use Agent training data but lacked the infrastructure for market deployment
The Acquisition: Where they stalled, we stepped in. We aligned our own time, resources, and personal investment with the unfinished project, ensuring the work didn’t disappear. When the team disbanded, we were able to secure the complete dataset.
The Integration: We took their raw data collection and integrated it with our tokenization framework and distribution model. This meant rebuilding the technical infrastructure and creating an entirely new business model around the assets.
The Result: CLONES now holds full ownership of this data repository. By securing 12 months of specialized CUA training data at a fraction of the typical build cost, we’ve established the largest Web3-native dataset.
The Position: While competitors begin from scratch, CLONES enters the market with independently valued assets worth $3.5M. Others face a year of groundwork just to reach our starting point, while we’ll already be accelerating through network effects, community contributions, and sustained compounding growth
How Will We Use This Edge?
Immediate Dataset Tokenization
Transform vault assets into tradeable tokens with threshold-gated access:
Foundation Meta-Datasets using vault data as premium seed content
Quality Benchmarks established from vault data analysis
Instant Trading Volume through tokenized vault assets
Enterprise Positioning leveraging proven data quality
Community Growth Foundation
Use vault data to establish ecosystem standards:
Quality baselines for community contributions
Validation algorithms trained on vault data patterns
Meta-dataset curation prioritizing vault-level quality
Network effects where community data enhances vault value
Enterprise Market Entry
Leverage vault assets for immediate commercial relationships:
Proven capabilities demonstrating CUA training expertise
Ready-to-deploy datasets eliminating enterprise development time
Quality differentiation against competitors starting from zero
Revenue generation through direct licensing and tokenized access
Market Leadership Position
Largest Repository of CUA Training Data in Web3
Most comprehensive collection of Computer Use Agent training data in Web3
12-month head start while competitors try to capture data from scratch
Enterprise-ready assets enabling immediate commercial deployment
Quality leadership setting Web3 ecosystem standards from launch
Enhanced Collection Infrastructure
Our original data collection tool has been significantly improved post-acquisition:
CLONES Quality Agent with smarter AI scoring beyond rigid percentages
Better context awareness detecting creative problem-solving and strategic choices
Confidence levels providing reliability metrics (0-100%) for each assessment
Holistic evaluation capturing nuances missed by mechanical scoring systems
This enhanced infrastructure ensures we can capture even higher quality data moving forward while maintaining our lead over competitors starting from zero.
The Data Vault transforms CLONES from a startup concept into an established platform with validated assets, proven methodology, and immediate market leadership.
Last updated