💾 Data & Downloads

Survey Responses and Analysis Exports

📊 Two Dataset Approaches

This project uses two complementary datasets derived from the same 500 survey responses:

  • 🎭 Persona-Based Dataset: Full survey responses with rule-based persona classification (6 personas, 25 columns)
  • 📊 Segment-Based Dataset: Transformed data with statistical K-means clustering (5 segments, 11 selected columns)

Both datasets contain the same 500 respondents but are structured differently for their respective analysis methodologies.

🎉 NEW: Interactive Response Viewer

View and explore all 500 survey responses directly in your browser!

No downloads needed - use our powerful interactive table with sorting, filtering, searching, and exporting capabilities.

📊 Available Datasets

🎭 Persona-Based Dataset Full Survey Responses with Rule-Based Classification

persona_web_hosting_data.csv

Primary Analysis Dataset

  • Records: 500 survey responses
  • Seed: 42 (for reproducibility)
  • Columns: 25 (Persona + Q1-Q24)
  • Classification: Rule-based personas (6 types)
  • WTP Format: Text ranges (e.g., "$51-100/month")
  • File Size: ~85KB
  • Status: Primary dataset

Personas (6 Types)

  • Small Business Owner (30.6%, Mean WTP: $60.15)
  • Marketing Professional (19.4%, Mean WTP: $79.27)
  • Agency/Freelance (19.2%, Mean WTP: $145.11)
  • Hobbyist (17.2%, Mean WTP: $45.75)
  • Enterprise (13.6%, Mean WTP: $368.36)

Contents: Complete survey responses with persona identification, all question responses (including conditional questions), and budget tiers. This is the primary dataset for persona-based analysis.

🎯 Strategic Segments Dataset Jobs-to-be-Done Market Framework

segment-profiles.md

Strategic Segments Analysis

  • Records: 5 strategic segments (A-E)
  • Framework: Jobs-to-be-Done (JTBD)
  • Content: Segment profiles with JTBD, value drivers, pain points, triggers
  • Classification: Motivation-based segmentation
  • WTP Format: Range values (e.g., $8-25/month)
  • File Size: ~8KB
  • Status: Strategic analysis dataset

Strategic Segments (5 Groups)

  • Segment A: Ease-First Starters ($8-25/mo) - "Fast-Lane Launchers"
  • Segment B: Performance Maximizers ($40-120/mo) - "Speed & Stability Seekers"
  • Segment C: Agencies & Pro Creators ($80-200/mo) - "Multi-Site Professionals"
  • Segment D: Growth Marketers & Commerce Builders ($50-150/mo) - "ROI Optimizers"
  • Segment E: Technical Builders / DevOps-Lite ($100-300/mo) - "Full-Control Builders"

Contents: Complete strategic segment profiles including core identity, Jobs-to-be-Done statements, value drivers, pain points, marketing triggers, and DreamHost product recommendations. Optimized for product development and marketing strategy.

📈 Analysis Output Data

🎭 Persona-Based Analysis Outputs

persona_profiles.csv

Detailed Persona Statistics

  • Records: 6 personas
  • Columns: 9 metrics
  • Purpose: Marketing team persona targeting

Metrics Included: Sample size, percentage of total, mean budget, median budget, standard deviation, average feature count, value score, price sensitivity

feature_correlations.csv

Feature-WTP Correlation Analysis

  • Records: 7 priority features
  • Columns: 8 statistical metrics
  • Purpose: Product team feature prioritization

Metrics Included: Feature name, selection rate, mean budget (selected/not selected), budget difference, t-statistic, p-value, Cohen's d, significance flag

pricing_tiers.csv

Recommended Pricing Tier Structure

  • Records: 4 tiers (Basic, Professional, Business, Enterprise)
  • Columns: 5 attributes
  • Purpose: Finance team revenue modeling

Attributes: Tier name, price point, target percentile, market coverage, positioning statement

monetization_models.csv

Four Monetization Models Classification

  • Records: 4 models (Maximizer, Penetrator, Underdog, Champion)
  • Columns: 6 metrics
  • Purpose: Strategy team segment targeting

Metrics Included: Model name, customer count, percentage, mean budget, mean value score, revenue potential

Persona Analysis Visualizations (6 PNG files)

  • persona_comparison.png - Persona distribution and characteristics
  • wtp_distribution.png - Willingness-to-Pay distribution across personas
  • feature_correlations.png - Feature-WTP correlation matrix
  • price_sensitivity.png - Price sensitivity by persona
  • monetization_models.png - Four monetization models analysis
  • feature_classification_matrix.png - Feature importance classification
🎯 Strategic Segments Analysis Outputs

segment-profiles.md

Strategic Segment Profiles

  • Records: 5 strategic segments (A-E)
  • Content: JTBD framework, value drivers, pain points, triggers
  • Purpose: Product development and marketing strategy

Content Included: Core identity, Jobs-to-be-Done statements, value drivers, pain points, WTP ranges, marketing triggers, and DreamHost product recommendations for each segment

Strategic Segments on Website

  • segments.html - Complete strategic segment profiles with JTBD framework
  • Persona-to-segment mapping for coordinated strategy
  • Product recommendations for each segment
  • Marketing trigger messages for conversion optimization

📋 Sample Data Preview

Persona Distribution (from persona_web_hosting_data.csv)

Persona Count Percentage Mean WTP
Small Business Owner 153 30.6% $60.15
Marketing Professional 97 19.4% $79.27
Agency/Freelance 96 19.2% $145.11
Hobbyist 86 17.2% $45.75
Enterprise 68 13.6% $368.36

Budget Distribution (from persona_web_hosting_data.csv)

Budget Tier Count Percentage Midpoint Value
$0-10/month 98 19.6% $5
$11-25/month 127 25.4% $18
$26-50/month 138 27.6% $38
$51-100/month 43 8.6% $75.50
$101-200/month 37 7.4% $150.50
$201-500/month 19 3.8% $350.50
$501+/month 38 7.6% $600

Recommended Pricing Tiers (from pricing_tiers.csv)

Tier Price Point Target Market Coverage
Basic $18/month 25th percentile 25%
Professional $38/month 50th percentile 50%
Business $150/month 75th percentile 25%
Enterprise $350/month 90th percentile 10%

🔄 Data Regeneration

How to Regenerate All Data

Both datasets can be fully reproduced using the provided Python scripts:

Persona-Based Dataset Generation

# Generate persona-based dataset (500 responses, seed=42)
python3 generate_survey_responses.py 42 500 persona_web_hosting_data.csv

Persona-Based Analysis

# Run full WTP analysis on persona dataset
python3 analyze_wtp.py

Result: Generates 6 analysis CSV files and 6 visualization PNG files in the analysis/ directory.

Segment-Based Analysis

# Analyze segment-based dataset (requires segmented_web_hosting_data.csv)
python3 analyze_segments.py

Result: Generates segment_profiles.csv and 4 segment visualization PNG files in the analysis/ directory.

Note: The segmented_web_hosting_data.csv is a derived dataset created through data transformation and K-means clustering. Both datasets use the same seed (42) for reproducibility.

📖 Data Dictionary

Survey Response Fields (persona_web_hosting_data.csv)

Field Type Description
Persona String Persona classification (Small Business Owner, Marketing Professional, etc.)
Q1 String Role identification
Q2 String Decision-making involvement
Q3 String Currently evaluating providers?
Q4 String Website building method
Q5 String Number of websites managed
Q6 String (pipe-separated) Website purposes (multi-select)
Q7 String (pipe-separated) Top priorities (multi-select, max 3)
Q8-Q13 String or NULL Conditional questions based on Q7 selections
Q14 String (pipe-separated) Extended features (multi-select, max 3)
Q15-Q21 String or NULL Conditional questions based on Q14 selections
Q22 String or NULL Agency/Marketing features (conditional)
Q23 String (pipe-separated) Discovery channels (multi-select, max 2)
Q24 String Monthly budget (WTP metric)