Github repo for project 7 of CMSC 33550 W20

Project 7: Lazy Evaluation of Python Pandas

In many cases, Pandas is used within Python environments in an ad-hoc way: users submitting pandas commands as needed. As the analysis mature, these ad-hoc programs make their way into scripts and processing pipelines. There's an opportunity to increase the performance of those pipelines if the processing 'plan' was known a priori. The goal of this project is to define what is a good format for such processing plan and then build an artifact that can extract processing plans from imperative-style Python programs.

Initial scope:

Pandas pipelines, even in a single file or ipython notebook, can be rather complicated and intermixed with calls to other libraries reading and writing data to and from series and dataframes. In this project, we wish to define a wrapper around the pandas library. It should capture access to data both in read and write and operations on data, such as group, join, merge, selection, etc. Once the data are requested, the relevant operation should be identified, sent to an optimizer for sequencing/transformation and then returned

See:

https://pandas.pydata.org/pandas-docs/stable/getting_started/comparison/comparison_with_sql.html

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.github/workflows		.github/workflows
demo_suite		demo_suite
dist		dist
lazy_pandas.egg-info		lazy_pandas.egg-info
src		src
test_suite		test_suite
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
files.txt		files.txt
lazy_pandas.jpeg		lazy_pandas.jpeg
make.inc		make.inc
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Github repo for project 7 of CMSC 33550 W20

Project 7: Lazy Evaluation of Python Pandas

Initial scope:

See:

About

Releases

Packages

Languages

marcelloPuligheddu/databases_project7

Folders and files

Latest commit

History

Repository files navigation

Github repo for project 7 of CMSC 33550 W20

Project 7: Lazy Evaluation of Python Pandas

Initial scope:

See:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages