Mitosheet default df renderer #1330

aarondr77 · 2024-09-11T21:43:50Z

Description

Makes Mito the default dataframe renderer in Jupyter. When a dataframe is hanging at the end of the a code cell, it is automatically displayed in a mitosheet instead of the static, pandas view that only shows a subset of the data.

Testing

Use the Test Notebook.ipynb to try various approaches. Some things to look for:

This implementation should never overwrite existing code in the notebook
If the mitosheet is unable to render, it should fallback to the default pandas dataframe viewer.

In addition, see the new frontend tests that ensure this behavior.

Documentation

yes, we need to update the mito docs.

vercel · 2024-09-11T21:43:53Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
monorepo	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Oct 3, 2024 7:33pm

mitosheet/src/DataFrameMimeRenderer.tsx

aarondr77 · 2024-09-12T14:02:59Z

mitosheet/src/DataFrameMimeRenderer.tsx

+        let dataframeVariableName = undefined;
+        if (activeCellIndex) {
+            const previousCell = getCellAtIndex(cells, activeCellIndex - 1)
+            dataframeVariableName = getLastNonEmptyLine(getCellText(previousCell))


Test more cases: things like having two dataframes as the output, non dataframes as the output, etc.

Add UI tests for these

Add tests for:

Tuple of dataframes

Series

plotly graph

mitosheet/src/DataFrameMimeRenderer.tsx

mitosheet/src/plugin.tsx

aarondr77 · 2024-09-12T14:33:43Z

Make code gen work. Need to get the correct dataframe name.

aarondr77 · 2024-09-12T14:51:46Z

Using Enter to submit column renames does not work until you click somewhere else in the widget.

This might just be on Chrome btw. But we need to fix it since Chrome is most popular

aarondr77 · 2024-09-12T14:54:27Z

Don't register analysis.

aarondr77 · 2024-09-12T14:54:37Z

Don't have email popup

aarondr77 · 2024-09-12T15:17:01Z

Question: Each time we add a mitosheet.sheet() call the notebook size grows by about 2MB, but adding Mito as a default dataframe output only increases the size of the saved notebook by 2KB.

I think this is because the mime renderer doesn't actually get saved to the notebook (at least right now). But if I were to look at the size of the notebook while being used it would still grow 2MB per mito spreadsheet default dataframe ouput

aarondr77 · 2024-10-01T16:11:26Z

Approaches tried using the mime render, documenting for future reference

The challenging part is figuring out which dataframe to display in the mitosheet. To figure this out, we need to find the code cell that triggered this dataframe render and get the dataframe on its last line.

Finding the code cell is challenging however. Below describes a few options we tried and why they don't work.

Using the activeCellIndex: We cannot use the activeCellIndex to identify the cell ID because when running a bunch of cells in a row (for example, using run all cells) or when the code cell takes a few seconds to execute, the active cell in the notebook tracker updates before we're able to save it. As a result, we end up thinking the code cell that triggered the dataframe render is at the bottom of the notebook.
Using the execution count: We cannot use the execution count because the execution count will not update until the mime render is created. When we run the code cell df, that cell is responsible for creating the mime renderer. As a result, when we search the cells for the execution count, of ie: 3, the closest execution count that we get is 2.
Use the dom to find the corresponding code cell ID. This works as follows: 1) Render the default renderer so that the we have a DOM element to start with. 2) Traverse up to find the Code Cell that triggered the dataframe render (the first code cell we find) 3) Get the code cell ID from the code cell's model. 4) Use the cell ID to find the input cell and read the dataframe name from it.

However, this didn't work for the reasons below.

Why using Mime Renderes did not work

There is still a race condition bug where if code cell 1 creates a dataframe renderer, and code cell 2 edits the dataframe, the mitosheet output will show the dataframe state after code cell 2 has run, instead of the state of the dataframe at code cell 1.

This occurs because in order to create the mitosheet, we need to execute the mitosheet.sheet() function. To do so, we had to send a new kernel message from the mimerender with the code mitosheet.sheet(df). However, becasue the kernel message queue might have had additional messages already queued that edited the df, by the time the mitosheet was rendered, it might have displayed a dataframe that reflected future code cell edits instead of the current state of the dataframe at the time the code cell with the hanging df was executed. This is not what we want.

aarondr77

Another review

mitosheet/Test Notebook.ipynb

mitosheet/src/jupyter/extensionUtils.tsx

mitosheet/src/plugin.tsx

aarondr77 · 2024-10-01T18:11:22Z

mitosheet/src/plugin.tsx

+            let codeCell = getCellAtIndex(cells, mimeRenderInputCellIndex + 1)
+            const codeCellText = getCellText(codeCell);
+
+            if (codeCell === undefined) {


Turn this logic into a helper function and share it with the other write generated code

mitosheet/src/plugin.tsx

aarondr77 · 2024-10-03T17:52:26Z

Make sure print(df) still returns the correct result.

aarondr77 added 3 commits September 10, 2024 17:10

mitosheet: create a custom renderer

3e89bf9

mitosheet: render a mitosheet as dataframe output

ba8c0c7

mitosheet: render the dataframe in mito

37a0849

aarondr77 changed the base branch from dev to jupyterlab-4-manually September 11, 2024 21:44

vercel bot deployed to Preview September 11, 2024 21:44 View deployment

aarondr77 commented Sep 12, 2024

View reviewed changes

aarondr77 added 4 commits September 12, 2024 15:00

mitosheet: get code generation working from df mimerender sheet

16b762f

mitosheet: display default renderer if not dataframe

fbb3ac0

mitosheet: fix dev experience -- let watch commands run in parallel

6ec3f8a

mitosheet: don't overwrite code cell below mime render

e837fc9

vercel bot deployed to Preview September 12, 2024 19:40 View deployment

mitosheet: improve code gen stability for edge cases

0a837f2

vercel bot deployed to Preview September 12, 2024 20:44 View deployment

mitosheet: get args from code cell

f46604b

vercel bot deployed to Preview September 12, 2024 21:04 View deployment

mitosheet: remove uneeded code

95cce32

vercel bot deployed to Preview September 12, 2024 21:47 View deployment

mitosheet: cleanup

c6b5d04

vercel bot deployed to Preview September 13, 2024 13:57 View deployment

aarondr77 added 4 commits September 23, 2024 09:39

mitosheet: active cell and execution approach, not working

17e85dc

mitosheet: update approach to use DOM

450c3ff

mitosheet: document new approach

e8394e0

mitosheet: cleanup dataframe mime renderer

6b9d53f

vercel bot deployed to Preview September 23, 2024 15:31 View deployment

mitosheet: new approach -- overwriting dataframe ipython render

2ea1dfd

vercel bot deployed to Preview September 30, 2024 21:21 View deployment

mitosheet: automatically import mitosheet package

0e91ff1

vercel bot deployed to Preview October 1, 2024 15:52 View deployment

mitosheet: cleanup + document

8cfb536

vercel bot deployed to Preview October 1, 2024 16:22 View deployment

mitosheet: fix streamlit + cleanup

2280228

vercel bot deployed to Preview October 1, 2024 16:37 View deployment

aarondr77 commented Oct 1, 2024

View reviewed changes

mitosheet: address most of review

2142bf0

vercel bot deployed to Preview October 1, 2024 18:35 View deployment

mitosheet: remove unused imports

cdea1e8

vercel bot deployed to Preview October 2, 2024 14:57 View deployment

tests: add frontend test for df renderer

66d0171

vercel bot deployed to Preview October 2, 2024 21:17 View deployment

tests: more frontend tests

5688ee2

vercel bot deployed to Preview October 2, 2024 21:27 View deployment

mitosheet: fix linting errors

e539deb

vercel bot deployed to Preview October 2, 2024 21:31 View deployment

tests: improve flakyness

707d72e

vercel bot deployed to Preview October 3, 2024 19:33 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mitosheet default df renderer #1330

Mitosheet default df renderer #1330

aarondr77 commented Sep 11, 2024 •

edited

Loading

vercel bot commented Sep 11, 2024 •

edited

Loading

aarondr77 Sep 12, 2024

aarondr77 Sep 20, 2024

aarondr77 commented Sep 12, 2024 •

edited

Loading

aarondr77 commented Sep 12, 2024 •

edited

Loading

aarondr77 commented Sep 12, 2024

aarondr77 commented Sep 12, 2024

aarondr77 commented Sep 12, 2024

aarondr77 commented Oct 1, 2024 •

edited

Loading

aarondr77 left a comment

aarondr77 Oct 1, 2024

aarondr77 commented Oct 3, 2024

Mitosheet default df renderer #1330

Are you sure you want to change the base?

Mitosheet default df renderer #1330

Conversation

aarondr77 commented Sep 11, 2024 • edited Loading

Description

Testing

Documentation

vercel bot commented Sep 11, 2024 • edited Loading

aarondr77 Sep 12, 2024

Choose a reason for hiding this comment

aarondr77 Sep 20, 2024

Choose a reason for hiding this comment

aarondr77 commented Sep 12, 2024 • edited Loading

aarondr77 commented Sep 12, 2024 • edited Loading

aarondr77 commented Sep 12, 2024

aarondr77 commented Sep 12, 2024

aarondr77 commented Sep 12, 2024

aarondr77 commented Oct 1, 2024 • edited Loading

Approaches tried using the mime render, documenting for future reference

Why using Mime Renderes did not work

aarondr77 left a comment

Choose a reason for hiding this comment

aarondr77 Oct 1, 2024

Choose a reason for hiding this comment

aarondr77 commented Oct 3, 2024

aarondr77 commented Sep 11, 2024 •

edited

Loading

vercel bot commented Sep 11, 2024 •

edited

Loading

aarondr77 commented Sep 12, 2024 •

edited

Loading

aarondr77 commented Sep 12, 2024 •

edited

Loading

aarondr77 commented Oct 1, 2024 •

edited

Loading