Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: restructure existing modules, and add Column, Expression, and Functions modules #1

Merged
merged 3 commits into from
Feb 28, 2024

Conversation

sjrusso8
Copy link
Owner

@sjrusso8 sjrusso8 commented Feb 28, 2024

Description

feat: restructure existing modules, and add Column, Expression, and Functions modules

  • refactor existing modules to mirror the pyspark connector layout
  • move logic plan creation into the plan module and out of the dataframe module
  • create an expression module to handle Expression creation
  • create Column struct as a representation of a Column object
    • add traits for alias and name
  • create functions module containing the first workings of pyspark functions
    • add functions for col, pi, and coalesce
  • breaking change the select() trait now only excepts Vec<Column>
    • tests and examples are updated to match the change

Documentation

Pyspark Connector

 - move logic out of dataframe and into plan
 - move all nested modules into their own top level file
 - add logic to implement sort & orderby
- updated select to take a Vec<Column>
- updated select to handle * colmns
- created initial functions
@sjrusso8 sjrusso8 changed the title Feat/refactor feat: restructure modules, add Column, Expression, and Functions Feb 28, 2024
@sjrusso8 sjrusso8 changed the title feat: restructure modules, add Column, Expression, and Functions feat: restructure existing modules, and add Column, Expression, and Functions modules Feb 28, 2024
@sjrusso8 sjrusso8 marked this pull request as ready for review February 28, 2024 13:36
@sjrusso8 sjrusso8 self-assigned this Feb 28, 2024
@sjrusso8 sjrusso8 merged commit faf2d8a into main Feb 28, 2024
3 checks passed
@sjrusso8 sjrusso8 deleted the feat/refactor branch February 28, 2024 20:48
sjrusso8 added a commit that referenced this pull request Mar 17, 2024
…unctions modules (#1)

**feat**: restructure existing modules, and add Column, Expression, and Functions modules

* refactor existing modules to mirror the pyspark connector layout
* move logic plan creation into the `plan` module and out of the `dataframe` module
* create an `expression` module to handle `Expression` creation
* create `Column` struct as a representation of a `Column` object
     * add traits for `alias` and `name`
* create `functions` module containing the first workings of pyspark functions 
     * add functions for `col`, `pi`, and `coalesce` 
* **breaking change** the `select()` trait now only excepts `Vec<Column>`
     * tests and examples are updated to match the change
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant