← スキル一覧に戻る
rfm-customer-segmentation
liangdabiao / claude-data-analysis-ultra-main
⭐ 109🍴 20📅 2026年1月18日
Perform RFM (Recency, Frequency, Monetary) customer segmentation analysis on e-commerce data. Use when you need to analyze customer value, identify VIP customers, or create marketing segments. Automatically cleans data, calculates RFM metrics, applies K-means clustering, and generates visualization reports with Chinese language support.
read, write, bash, glob
SKILL.md
--- name: rfm-customer-segmentation description: Perform RFM (Recency, Frequency, Monetary) customer segmentation analysis on e-commerce data. Use when you need to analyze customer value, identify VIP customers, or create marketing segments. Automatically cleans data, calculates RFM metrics, applies K-means clustering, and generates visualization reports with Chinese language support. allowed-tools: Read, Write, Bash, Glob --- # RFM Customer Segmentation Analysis A comprehensive customer segmentation skill that automatically analyzes e-commerce transaction data to identify customer value segments using RFM (Recency, Frequency, Monetary) analysis with K-means clustering. ## Instructions ### 1. Data Analysis When users provide e-commerce data or ask about customer segmentation: - Load and validate the transaction data - Clean data by removing invalid orders (negative quantities, zero prices) - Calculate RFM metrics for each customer: - **Recency**: Days since last purchase - **Frequency**: Number of purchases - **Monetary**: Total purchase amount - Use K-means clustering on RFM dimensions - Automatically determine optimal number of clusters using elbow method ### 2. Customer Segmentation - Create customer value segments: High, Medium, Low value customers - Score each customer on RFM dimensions (1-3 scale) - Calculate overall customer value scores - Identify and rank VIP customers for marketing campaigns ### 3. Visualization and Reporting - Generate comprehensive customer segmentation dashboard - Create pie charts for segment distribution and revenue share - Build RFM scatter plots to visualize customer patterns - Generate box plots showing value distribution by segment - Export detailed CSV reports with VIP customer lists ### 4. Marketing Insights - Provide actionable marketing recommendations for each segment - Generate executive summary with key findings - Create customer activation strategies for different value tiers - Export VIP customer lists for targeted marketing campaigns ## Usage Examples ### Basic Customer Segmentation ``` Analyze these e-commerce orders and segment customers by value: [CSV data with order_id, user_id, purchase_date, quantity, unit_price] ``` ### VIP Customer Identification ``` Find the top 100 most valuable customers from our sales data for marketing campaign ``` ### Customer Value Analysis ``` Create a customer segmentation report showing revenue contribution by customer segment ``` ## Key Features - **Automatic Data Cleaning**: Handles Chinese e-commerce data formats, removes invalid orders - **Intelligent Clustering**: Uses elbow method to determine optimal cluster count - **Chinese Language Support**: Full support for Chinese field names and visualizations - **Comprehensive Reports**: Generates HTML reports, PNG dashboards, and CSV exports - **Marketing Ready**: Provides VIP customer lists and actionable insights ## File Requirements The skill works with e-commerce transaction data containing: - **user_id**: Customer identification code (用户码) - **order_date**: Purchase date (消费日期) - **quantity**: Order quantity (数量) - **unit_price**: Item unit price (单价) - **product_info**: Product details (optional) ## Output Files Generated - `customer_segments.csv`: Complete customer segmentation data - `vip_customers_list.csv`: Ranked VIP customer list for marketing - `segment_summary_statistics.csv`: Detailed statistics by segment - `customer_segmentation_dashboard.png`: Visual analytics dashboard - `data_validation_report.txt`: Data quality and analysis validation ## Dependencies - pandas, numpy for data processing - scikit-learn for K-means clustering - matplotlib, seaborn for visualization (with Chinese font support) - Standard Python libraries for file operations ## Best Practices - Ensure date fields are in consistent format (YYYY-MM-DD recommended) - Remove or handle missing values before analysis - Use sufficient data volume (1000+ orders recommended for reliable clustering) - Consider business context when interpreting segment results - Validate results with domain knowledge when possible