As developers and data analysts, throughout our professional
careers
we encounter the responsibility of managing information in systems of all kinds. We
well
know that information is a highly valuable asset for organizations.
When planning to conduct research, it is common for data scientists and other
professionals to share information and large datasets to carry out the analysis of
vast
amounts of information. Various tools and data repositories, such as Kaggle or
Google
Colab, facilitate this process.
However, due to the diversity in thinking and data structuring by individuals
contributing information in systems and research, maintaining a standard in a
straightforward manner becomes complex.
To address this challenge, we have decided to create dataset.expert. This framework
is
designed for data scientists, with the goal of facilitating the validation,
correction,
standardization, and control of data flow. Initially, the project will be available
in
TypeScript and Python, as the primary languages for data management. Nevertheless,
the
project is open source and available to the community, inviting everyone to
contribute
to foster and create a robust ecosystem. This will allow the establishment of
techniques
and standards from any language, improving the management and sharing of the
enormous
existing datasets.
We are blood brothers passionate about technology, data, and problem-solving. As brothers, we want to make a small contribution to the scientific and technological community with this project to make our lives much easier.
+10 years of experience on technology field, passionate for ML and AI
Datascientist for proffesion and a lover of mathemathics to solve problems with data
We are blood brothers passionate about technology, data, and problem-solving. As brothers, we want to make a small contribution to the scientific and technological community with this project to make our lives much easier.