Adding Directlake options for collecting metrics #118

dgosbell · 2024-03-20T23:32:34Z

Added 3 different modes for direct lake models so that we do not always force all columns to be transcoded into memory.

ResidentOnly - only run the distinctcount queries on columns that are current resident in memory
Referenced - run distinctcount queries on columns that are referenced by measures or relationships. This should provide the minimum information required for performance analysis of measures and for tools like DAX Optimizer.
Full - run distinctcount queries on all columns

I've added this option since the default behaviour at the moment is the same as Full and this can be detrimental to large models. It can take longer than necessary and potentially may still not produce the desired result since as we cycle through all the columns it could force some of the earlier columns to be paged out of memory.

I've set ResidentOnly as the default as it's the cheapest/fastest option.

Note that if an option other than ResidentOnly is chosen I am re-running the DMV collection after the stats collection has been run since the memory usage is usually changed as a result of running the distinctcount queries.

I did consider whether to add the selected Direct Lake analysis mode as a property to the vpax file, but I decided against his for the time being since it can be implied by checking the IsResident property on a column.

src/Directory.Packages.props

src/Dax.Model.Extractor/StatExtractor.cs

Co-authored-by: Alberto Spelta <[email protected]>

dgosbell added 2 commits March 21, 2024 09:28

Adding DirectLake analysis options

a911faf

Merge remote-tracking branch 'upstream/master' into directlake-updates

9f33efc

dgosbell requested a review from albertospelta as a code owner March 20, 2024 23:32

albertospelta linked an issue Mar 21, 2024 that may be closed by this pull request

direct lake - columns into memory #115

Open

albertospelta requested changes Mar 21, 2024

View reviewed changes

src/Directory.Packages.props Outdated Show resolved Hide resolved

src/Dax.Model.Extractor/StatExtractor.cs Outdated Show resolved Hide resolved

dgosbell and others added 2 commits March 22, 2024 08:55

Correcting DQ analysis comment

42facfe

Co-authored-by: Alberto Spelta <[email protected]>

Using MicrosoftAnalysisServicesVersion variable for package references

5c05000

Co-authored-by: Alberto Spelta <[email protected]>

albertospelta approved these changes Mar 22, 2024

View reviewed changes

albertospelta merged commit 1fa32d3 into sql-bi:master Mar 22, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Directlake options for collecting metrics #118

Adding Directlake options for collecting metrics #118

dgosbell commented Mar 20, 2024

Adding Directlake options for collecting metrics #118

Adding Directlake options for collecting metrics #118

Conversation

dgosbell commented Mar 20, 2024