Data Vault

The $3.5M Strategic Asset Foundation

CLONES launches with the largest repository of Computer Use Agent training data in Web3 — providing immediate market leadership and enterprise credibility through proven, validated assets.


Complete Asset Breakdown

User Action Sequences

Volume: 24,800,000 actions
Rate: $0.08 per action
Value: $1,984,000
Coverage: Complete interaction sequences

Mouse Event Data

Volume: 24,600,000 events
Rate: $0.05 per event
Value: $1,230,000
Coverage: Precision cursor tracking

Structured Text Prompts

Volume: 463,000 prompts
Rate: $0.50 per prompt  
Value: $231,500
Coverage: Task instructions and contextual guidance

Keyboard Input Data

Volume: 294,000 events
Rate: $0.20 per event
Value: $58,800
Coverage: Text input and shortcuts

Video Demonstrations

Volume: 1,550,000 minutes
Rate: $0.0015 per minute
Value: $2,325
Coverage: Complete workflow screen recordings

Valuation Methodology

Industry Rate Analysis

  • Enterprise AI datasets: $50,000-$200,000 per specialized collection

  • Manual annotation: $0.10-$5.00 per data point

  • Screen recording: $0.005-$0.015 per minute

  • Workflow documentation: $0.25-$2.00 per action sequence

Our Conservative Approach

  • Video data: $0.0015/min (67% below market average)

  • Action sequences: $0.08/action (competitive with annotation services)

  • Mouse events: $0.05/event (below specialized tracking rates)

  • Text prompts: $0.50/prompt (standard for structured instructions)

Quality Standards

  • Algorithmic scoring through validation systems

  • Human verification for critical workflows

  • Task completion tracking with success metrics

  • Enterprise-ready formatting for immediate deployment


How Did We Acquire This Edge?

We identified an opportunity where a team had spent 12 months collecting Computer Use Agent training data but lacked the infrastructure for market deployment

The Acquisition: Where they stalled, we stepped in. We aligned our own time, resources, and personal investment with the unfinished project, ensuring the work didn’t disappear. When the team disbanded, we were able to secure the complete dataset.

The Integration: We took their raw data collection and integrated it with our tokenization framework and distribution model. This meant rebuilding the technical infrastructure and creating an entirely new business model around the assets.

The Result: CLONES now holds full ownership of this data repository. By securing 12 months of specialized CUA training data at a fraction of the typical build cost, we’ve established the largest Web3-native dataset.

The Position: While competitors begin from scratch, CLONES enters the market with independently valued assets worth $3.5M. Others face a year of groundwork just to reach our starting point, while we’ll already be accelerating through network effects, community contributions, and sustained compounding growth


How Will We Use This Edge?

Immediate Dataset Tokenization

Transform vault assets into tradeable tokens with threshold-gated access:

  • Foundation Meta-Datasets using vault data as premium seed content

  • Quality Benchmarks established from vault data analysis

  • Instant Trading Volume through tokenized vault assets

  • Enterprise Positioning leveraging proven data quality

Community Growth Foundation

Use vault data to establish ecosystem standards:

  • Quality baselines for community contributions

  • Validation algorithms trained on vault data patterns

  • Meta-dataset curation prioritizing vault-level quality

  • Network effects where community data enhances vault value

Enterprise Market Entry

Leverage vault assets for immediate commercial relationships:

  • Proven capabilities demonstrating CUA training expertise

  • Ready-to-deploy datasets eliminating enterprise development time

  • Quality differentiation against competitors starting from zero

  • Revenue generation through direct licensing and tokenized access


Market Leadership Position

Largest Repository of CUA Training Data in Web3

  • Most comprehensive collection of Computer Use Agent training data in Web3

  • 12-month head start while competitors try to capture data from scratch

  • Enterprise-ready assets enabling immediate commercial deployment

  • Quality leadership setting Web3 ecosystem standards from launch

Enhanced Collection Infrastructure

Our original data collection tool has been significantly improved post-acquisition:

  • CLONES Quality Agent with smarter AI scoring beyond rigid percentages

  • Better context awareness detecting creative problem-solving and strategic choices

  • Confidence levels providing reliability metrics (0-100%) for each assessment

  • Holistic evaluation capturing nuances missed by mechanical scoring systems

This enhanced infrastructure ensures we can capture even higher quality data moving forward while maintaining our lead over competitors starting from zero.

The Data Vault transforms CLONES from a startup concept into an established platform with validated assets, proven methodology, and immediate market leadership.

Last updated