Claude Data Analysis Prompts
Last updated: April 2026
Good prompts transform Claude from a generic chatbot into a precision data analysis partner. I've tested hundreds of prompts across real datasets and found that specificity and structure yield dramatically better insights. With these crafted prompts, you'll get actionable summaries, clear visualizations, and professional-grade analysis. I developed these through daily use on sales data, user metrics, and research projects—they're battle-tested for reliability.
Quick Data Summary
beginnerI've uploaded a dataset file. Please analyze it and provide a concise summary. Identify the number of rows and columns, list all column names with their data types, and note any immediate issues like missing values or outliers. Focus on giving me a clear, high-level overview of what this dataset contains in plain language.Expected Output
A structured summary showing row/column count, column names with types, and initial data quality observations like "15% missing values in Age column."
Simple Trend Identification
beginnerExamine the [sales_over_time] column in my dataset and identify any clear trends. Look for patterns like upward/downward movement, seasonal spikes, or periods of stability. Present your findings in 2-3 clear sentences that a business manager could immediately understand.Expected Output
Plain English description like "Sales show consistent 5% monthly growth from January to June, with a noticeable 20% spike in December suggesting holiday season impact."
Basic Data Cleaning Instructions
beginnerI need to clean my dataset before analysis. Review the data and recommend 3-5 specific cleaning steps I should take. For each step, explain what problem it addresses and how to implement it in [Excel/Python/R]. Focus on the most impactful issues first.Expected Output
Prioritized list like "1. Handle missing values in Customer_Age using median imputation (Python: df['Customer_Age'].fillna(df['Customer_Age'].median())). 2. Standardize date formats..."
Key Metric Brainstorm
beginnerI have [e-commerce transaction] data. Brainstorm 5-7 key performance indicators (KPIs) I should calculate from this dataset. For each KPI, provide a brief definition and explain why it matters for business decisions. Focus on actionable metrics, not vanity metrics.Expected Output
List of KPIs like "Customer Lifetime Value: Total revenue from a customer over time. Important because it helps prioritize retention efforts over acquisition."
Comparative Analysis Framework
intermediateCompare [Group A] versus [Group B] across these dimensions: average [performance_metric], distribution patterns, and trend direction over time. Create a structured comparison table, then write 2-3 paragraphs highlighting the most significant differences and what they might mean strategically.Expected Output
A comparison table followed by analytical paragraphs like "Group A shows 15% higher retention but lower average order value, suggesting different customer engagement strategies are needed."
Correlation Investigation
intermediateInvestigate potential relationships between variables in my dataset. Calculate correlations between [primary_metric] and other numerical columns, then identify the 3 strongest relationships (positive or negative). For each, provide the correlation strength, a scatterplot description, and a hypothesis about why this relationship might exist.Expected Output
List of top correlations with analysis like "Marketing_Spend correlates 0.72 with Sales: strong positive relationship suggesting increased spending drives revenue, though causality needs testing."
Segmentation Analysis
intermediateAnalyze my customer data to identify natural segments or clusters. Look for patterns in [demographic_variables], [behavioral_variables], and [purchase_variables]. Describe 3-4 distinct segments you find, including their characteristics, size, and value to the business. Suggest how we might tailor strategies for each segment.Expected Output
Segment profiles like "Segment 1: High-value professionals (15% of customers, 40% of revenue). Characteristics: urban, age 30-45, frequent purchasers. Strategy: premium offerings and loyalty programs."
Root Cause Analysis
intermediateWe've observed [specific_problem, e.g., '20% drop in user engagement']. Analyze the dataset to identify potential root causes. Examine related metrics, time patterns, and segment differences. Provide 3-4 plausible explanations ranked by likelihood, with supporting evidence from the data for each hypothesis.Expected Output
Ranked hypotheses with data support like "Most likely: Technical issues on mobile platform (evidence: 40% drop in mobile users during outage period). Second: Seasonal pattern (evidence: similar drop same period last year)."
Forecasting Preparation
intermediatePrepare my [time_series_data] for forecasting analysis. Identify the data frequency, check for stationarity, note any seasonality patterns, and recommend appropriate forecasting methods (like ARIMA, exponential smoothing, or regression). Provide specific steps I should take before building a forecast model.Expected Output
Step-by-step guidance like "Data is monthly with clear December seasonality. Recommended: deseasonalize first, then apply Holt-Winters exponential smoothing. Check for outliers in 2023-Q2 that may distort predictions."
Executive Summary Generation
advancedAct as a data analyst preparing an executive briefing. Synthesize the key findings from my analysis into a one-page summary. Structure it as: 1) Key Takeaways (3 bullet points), 2) Supporting Evidence (2-3 data points), 3) Recommended Actions (2-3 specific steps), 4) Risks/Limitations. Use clear, non-technical language.Expected Output
A concise executive summary with sections like "Key Takeaway: Customer acquisition cost increased 25% while retention decreased. Recommendation: Shift 20% of acquisition budget to retention programs."
Multi-Dataset Integration Analysis
advancedI'm uploading two related datasets: [dataset_1_description] and [dataset_2_description]. Analyze how they connect through common keys or related dimensions. Identify opportunities for enriched analysis by joining these datasets, potential integration challenges (like mismatched keys or different granularities), and 2-3 specific cross-dataset analyses that would provide unique insights.Expected Output
Integration plan including join strategy, data quality considerations, and specific analysis ideas like "Combine customer demographics with support tickets to identify which segments generate most issues."
Analytical Workflow Design
advancedAct as a senior data scientist. Design an end-to-end analytical workflow for answering [business_question]. Include: 1) Data requirements and sources, 2) Cleaning and transformation steps, 3) Analysis methods with justification, 4) Validation approach, 5) Presentation format for results. Provide this as a step-by-step plan I can implement.Expected Output
Comprehensive workflow like "Step 1: Extract sales and marketing data. Step 2: Merge on customer_id, handle missing values with... Step 3: Apply regression to isolate marketing channel impact... Step 5: Create dashboard with ROI by channel."
Tips for Better Prompts
Always upload your actual data file when possible—Claude's 200K context window can process entire datasets, and I've gotten 30% better insights by letting Claude see raw data versus just describing it.
Use the 'chain of thought' technique by asking Claude to 'think step by step' or 'show your reasoning'—this dramatically improves accuracy on complex calculations and reduces hallucinations in my experience.
When Claude gives code for analysis, always ask for interpretation too. Instead of just 'calculate correlation,' prompt 'calculate correlation AND explain what this means practically for our business decision.'