A Python library that enables conversational data analysis on SQL, CSV, and parquet files using LLMs and RAG.
PandasAI is a Python library that enables users to chat with their databases and data files using natural language. It allows both technical and non-technical users to perform data analysis by asking questions in plain English, leveraging LLMs and RAG to generate insights, visualizations, and answers directly from SQL, CSV, or parquet data sources.
Data analysts, data scientists, and business users who need to query and analyze data without writing complex SQL or Python code, as well as developers building conversational data interfaces.
It dramatically reduces the time and effort required for data exploration by providing an intuitive, chat-based interface that works across multiple data formats and supports advanced features like visualization generation and secure sandboxed execution.
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Allows users to query data in plain English, making analysis accessible to non-technical users, as shown in examples like df.chat('What is the average revenue by region?') returning direct answers.
Supports analyzing relationships across multiple DataFrames with a single query, demonstrated in the code example combining employees_df and salaries_df to find who gets paid the most.
Generates charts and plots directly from natural language requests, such as plotting a histogram with specified colors, reducing manual coding effort.
Provides Docker sandbox for isolated code execution, mitigating security risks, as described in the Docker Sandbox usage section for safe data handling.
Relies on LLM APIs like OpenAI, which requires internet access, API keys, and can incur ongoing costs, limiting offline use and increasing operational overhead.
Installation involves multiple packages (pandasai, pandasai-litellm, pandasai-docker) and configuration steps, adding complexity compared to straightforward pandas usage.
Queries are interpreted by LLMs, which can lead to misinterpretations or incorrect code generation, especially with ambiguous natural language, requiring manual verification.
A VSCode extension that allows you to use ChatGPT
ChatGPT integration with Unity Editor
ChatGPT Neovim Plugin: Effortless Natural Language Generation with OpenAI's ChatGPT API
Seamlessly integrate LLMs into scikit-learn.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.