CSV Transformer
CautionTransforms, cleans, and converts data between CSV, JSON, Excel, and other tabular formats with column mapping, type casting, and validation.
Install
Claude Code
Copy the SKILL.md file to your project's .claude/skills/ directory About This Skill
CSV Transformer handles the messy reality of data formats. It converts between CSV, JSON, TSV, Excel, and other tabular formats while cleaning, normalizing, and validating data along the way. Whether you need a simple format conversion or a complex multi-file merge with column mapping, this skill gets it done.
How It Works
- Input analysis — Detects file encoding, delimiter, quoting style, and column types automatically
- Schema mapping — Maps source columns to target schema with rename, merge, split, and type cast operations
- Data cleaning — Normalizes dates, trims whitespace, handles nulls, deduplicates, and fixes encoding issues
- Transformation — Applies filters, aggregations, pivots, and computed columns
- Output generation — Writes clean data in the target format with proper escaping and encoding
Best For
- Converting between data formats for different tools and APIs
- Cleaning vendor-provided data dumps before database import
- Merging data exports from multiple sources into a unified format
- Quick data exploration and profiling of unfamiliar datasets
Encoding Support
Handles UTF-8, UTF-16, Latin-1, Shift-JIS, and GB2312 encodings. Automatically detects BOM markers and line ending styles (CRLF/LF).
Use Cases
- Convert CSV to JSON with nested object structure
- Clean and normalize messy data: dates, phone numbers, currencies
- Merge multiple CSV files with different column names into one schema
- Filter and aggregate rows based on complex conditions
Pros & Cons
Pros
- + Auto-detects encoding, delimiter, and column types
- + Handles multi-file merges with schema reconciliation
- + Supports complex transformations: pivot, aggregate, computed columns
Cons
- - Very large files (>1GB) may need streaming approach
- - Excel formatting (merged cells, formulas) may not fully preserve
Related Skills
Pandas Assistant
CautionOptimizes Python pandas workflows by writing efficient DataFrame operations, fixing common performance pitfalls, and converting between pandas, polars, and SQL.
Data Pipeline
CautionDesigns and implements ETL/ELT data pipelines using Python, SQL, and orchestration tools like Airflow, dbt, and Prefect for batch and streaming workflows.
Stay Updated on Agent Skills
Get weekly curated skills + safety alerts
每周精选 Skills + 安全预警