You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Revised the overview to reflect AI-assisted classification and harmonized datasets. Added support for dask, geospatial, and fixed-width files. Expanded the dependencies section to include Dask, Geopandas, Matplotlib, Torch, and several Python packages. Updated installation instructions and added a link to the documentation for more examples.
Package socio4health is an extraction, transformation, loading (ETL), and AI-assisted query and visualization (AI QV) tool designed to simplify the intricate process of collecting and merging data 📊 from multiple sources, focusing on sociodemographic and census datasets from Colombia, Brazil, and Peru, into a unified relational database structure.
25
+
Package socio4health is an extraction, transformation, loading (ETL), and AI-assisted classification tool designed to simplify the intricate process of collecting and merging data from multiple sources, focusing on sociodemographic and census datasets from Colombia, Brazil, and Peru, into a harmonized dataset.
26
26
</p>
27
27
28
28
- Seamlessly retrieve data from online data sources through web scraping, as well as from local files.
29
-
- Support for various data formats, including `.csv`, `.xlsx`, `.xls`, `.txt`, `.sav`, and compressed files, ensuring versatility in sourcing information.
30
-
- Consolidating extracted data into a pandas DataFrame.
31
-
- Consolidating transformed data into a cohesive relational database.
32
-
- Conduct precise queries and apply transformations to meet specific criteria.
29
+
- Support for various data formats, including `.csv`, `.xlsx`, `.xls`, `.txt`, `.sav`, fixed-width files and geospatial files, ensuring versatility in sourcing information.
30
+
- Consolidating extracted data into a pandas (or dask) DataFrame.
0 commit comments