A Java utility for loading on-premises data into Salesforce Einstein Analytics datasets with autoloading, dataflow control, and dataset inspection.
DatasetUtils is a Java-based utility designed to load on-premises data into Salesforce Einstein Analytics datasets. It provides features like autoloading, dataflow control, and dataset inspection to streamline data integration processes. The tool supports both command-line and server modes, making it versatile for different use cases.
Salesforce administrators and developers who need to integrate on-premises or external data sources with Einstein Analytics datasets. It is also suitable for data engineers managing data pipelines into Salesforce.
DatasetUtils offers a free, community-driven alternative to proprietary data loading tools for Einstein Analytics, with flexible deployment options and support for various data operations. Its ability to handle both large and small datasets with features like encoding detection and error logging makes it a practical choice for data integration tasks.
Friendly utility to load your on-prem data, whether large or small, to Einstein Analytics Datasets, with useful features such as autoloading, dataflow control and dataset inspection.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Supports multiple load operations like Overwrite, Upsert, Append, and Delete for CSV files, as detailed in the console mode parameters, enabling versatile data integration scenarios.
Offers both server mode with a Web UI for easy interaction and console mode for scripting automation, as shown in the running instructions, catering to different user preferences.
Automatically detects file encoding to prevent data corruption and allows downloading error logs for troubleshooting, based on the features list and usage examples like detectEncoding and downloadErrorFile actions.
Enables download and upload of dataset metadata (XMD) JSON files for customization, as per the action options like downloadxmd and uploadxmd, facilitating advanced dataset management.
The README explicitly states that the project is at end-of-life with deprecated code and old dependencies like Log4j 1.2, indicating no future maintenance or improvements.
Requires specific Java JDK versions from Zulu Open JDK and manual installation steps, which can be error-prone and time-consuming compared to more modern, containerized tools.
As a community project not supported by Salesforce, users cannot rely on official help, documentation, or guaranteed updates, as emphasized in the disclaimer, increasing risk for production use.