About the Project

Go to Repo on Github

Dataset.Expert

The framework for data scientists to validate, correct, standardize, and control data flow effectively

As developers and data analysts, throughout our professional careers we encounter the responsibility of managing information in systems of all kinds. We well know that information is a highly valuable asset for organizations.

When planning to conduct research, it is common for data scientists and other professionals to share information and large datasets to carry out the analysis of vast amounts of information. Various tools and data repositories, such as Kaggle or Google Colab, facilitate this process.

However, due to the diversity in thinking and data structuring by individuals contributing information in systems and research, maintaining a standard in a straightforward manner becomes complex.

To address this challenge, we have decided to create dataset.expert. This framework is designed for data scientists, with the goal of facilitating the validation, correction, standardization, and control of data flow. Initially, the project will be available in TypeScript and Python, as the primary languages for data management. Nevertheless, the project is open source and available to the community, inviting everyone to contribute to foster and create a robust ecosystem. This will allow the establishment of techniques and standards from any language, improving the management and sharing of the enormous existing datasets.

About the leaders

We are blood brothers passionate about technology, data, and problem-solving. As brothers, we want to make a small contribution to the scientific and technological community with this project to make our lives much easier.

Angel Ercik Cruz Olivera

FullStack Developer

+10 years of experience on technology field, passionate for ML and AI

Diana Laura Cruz Olivera

Data scientists

Datascientist for proffesion and a lover of mathemathics to solve problems with data

Other contributors

Jhon Doe

FullStack Developer

Jane doe

Data scientists

Jhon Doe

FullStack Developer

Jane doe

Data scientists

The project

dataset.expert