A VS Code extension for visually exploring, cleaning, and transforming tabular data with automatic Pandas code generation.
Data Wrangler is a Visual Studio Code extension that provides a graphical interface for exploring, cleaning, and transforming tabular data. It allows users to visually manipulate datasets and automatically generates the corresponding Pandas code, streamlining data preparation tasks directly within the IDE.
Data scientists, analysts, and developers working with tabular data in Python who use VS Code or Jupyter Notebooks and want to accelerate data cleaning and exploration with a visual tool.
It uniquely combines an interactive UI for data manipulation with automatic code generation, reducing the manual effort of writing Pandas transformations while ensuring reproducibility and seamless integration into existing Python workflows.
Data Wrangler extension for Visual Studio Code
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Generates Pandas code in real-time as you apply UI transformations, making data cleaning reproducible and reducing manual coding effort, as highlighted in the README's editing mode features.
Integrates GitHub Copilot and FlashFill to suggest and apply transformations based on examples, speeding up repetitive tasks, as mentioned in the key features.
Launches directly from CSV files or Jupyter notebooks within VS Code, providing a unified workflow without context switching, as shown in the README's setup and opening sections.
Operates in a sandboxed mode where changes are previewed and not applied to the original data until explicitly exported, ensuring safe exploration as described in the setup.
Requires VS Code, Python 3.8+, and additional extensions like Jupyter and Python, creating a barrier for users outside this ecosystem, as noted in the setup instructions.
While it covers common operations, complex or niche data manipulations may still require writing custom Pandas code, as the built-in set is finite and focused on basic cleans.
Heavily tied to Microsoft tools (VS Code, GitHub Copilot), which might not suit teams using diverse or open-source-centric toolchains, limiting flexibility.