tidyextractors

tidyextractors is a Python package that makes extracting data from supported sources (e.g. email mbox files, source code log files) as painless as possible, delivering you a populated Pandas DataFrame in just a few lines of code.

About tidyextractors

tidyextractors makes extracting data from supported sources as painless as possible, delivering you a populated Pandas DataFrame in just a few lines of code. tidyextractors was inspired by Hadley Wickham’s (2014) paper which introduces “tidy data” as a conceptual framework for data preparation.

Features

  • Extracts data with minimal effort.
  • Creates readable code that requires minimal explanation.
  • Exports Pandas Dataframes to maximize compatibility with the Python data science ecosystem.

Currently implemented data sources

Install

pip3 install tidyextractors

Docs

See the tidyextractors docs for more information, including code examples, API reference, and general documentation.

Project members: 
Designer and Developer
Last updated: February 05, 2019

Projects by status

Most common topics