added ability to use config file to shard vicuna #1565

Eliasj42 · 2023-06-21T04:09:51Z

you can now pass a config file generated with github.com/nod-ai/SHARK/blob/main/shark/shark_generate_model_config.py

powderluv · 2023-06-21T05:35:15Z

@dan-garvey / @PhaneeshB please review / merge

apps/language_models/scripts/vicuna.py

Eliasj42 · 2023-06-21T21:59:47Z

apps/language_models/scripts/vicuna.py

+            config_json = json.load(config_file)
+            config_file.close()
+        else:
+            config_json = None


load config if given

Eliasj42 · 2023-06-21T22:00:41Z

apps/language_models/src/pipelines/vicuna_sharded_pipeline.py

    ) -> None:
        super().__init__(model_name, hf_model_path, max_num_tokens)
        self.max_sequence_length = 256
        self.device = device
        self.precision = precision
        self.tokenizer = self.get_tokenizer()
+        self.config = config_json


give Vicuna class config property so device indices can be accessed

Eliasj42 · 2023-06-21T22:02:51Z

apps/language_models/src/pipelines/vicuna_sharded_pipeline.py

+                        idx_votes[int(self.config[key]["gpu"])] = 1
+            device_idx = max(idx_votes, key=idx_votes.get)
+            return device_idx
+


define function to extract the device index from config file.

The config stores the model in its most granular state, so this function looks for the device used for every layer in the shard, and will use majority vote if different devices are used throughout the shard

Elias your comments are really good. But you should put them in the code instead of the review!

Eliasj42 · 2023-06-21T22:03:20Z

apps/language_models/src/pipelines/vicuna_sharded_pipeline.py

                    module = SharkInference(
                        mlirs[idx],
                        device=device,
-                        device_idx=idx % 1,
+                        device_idx=device_idx,


change device index to use config file instead of defaulting to 0

dan-garvey

Happy to give another review if you want to add some of your comments to the code = )

dan-garvey · 2023-06-21T23:59:36Z

apps/language_models/src/pipelines/vicuna_sharded_pipeline.py

+                        idx_votes[int(self.config[key]["gpu"])] = 1
+            device_idx = max(idx_votes, key=idx_votes.get)
+            return device_idx
+


Elias your comments are really good. But you should put them in the code instead of the review!

Eliasj42 · 2023-06-22T21:28:59Z

I added comments.

Also, I realized that the embbeding and decoding layers weren't configurable, so I added in functionality for that

Eliasj42 requested review from dan-garvey and PhaneeshB June 21, 2023 04:10

Eliasj42 force-pushed the add-sharding-config-2 branch from 43a6a4a to cd270d5 Compare June 21, 2023 04:12

Eliasj42 commented Jun 21, 2023

View reviewed changes

apps/language_models/scripts/vicuna.py Show resolved Hide resolved

Eliasj42 commented Jun 21, 2023

View reviewed changes

apps/language_models/scripts/vicuna.py Show resolved Hide resolved

Eliasj42 commented Jun 21, 2023

View reviewed changes

dan-garvey previously approved these changes Jun 22, 2023

View reviewed changes

added ability to use config file to shard vicuna

cb96194

Eliasj42 dismissed dan-garvey’s stale review via cb96194 June 22, 2023 21:15

Eliasj42 force-pushed the add-sharding-config-2 branch from cd270d5 to cb96194 Compare June 22, 2023 21:15

Eliasj42 requested a review from dan-garvey June 22, 2023 21:27

dan-garvey approved these changes Jun 22, 2023

View reviewed changes

dan-garvey merged commit 8822b9a into main Jun 22, 2023

dan-garvey deleted the add-sharding-config-2 branch November 3, 2023 21:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added ability to use config file to shard vicuna #1565

added ability to use config file to shard vicuna #1565

Eliasj42 commented Jun 21, 2023

powderluv commented Jun 21, 2023

Eliasj42 Jun 21, 2023

Eliasj42 Jun 21, 2023 •

edited

Loading

Eliasj42 Jun 21, 2023

dan-garvey Jun 21, 2023

Eliasj42 Jun 21, 2023

dan-garvey left a comment

dan-garvey Jun 21, 2023

Eliasj42 commented Jun 22, 2023

added ability to use config file to shard vicuna #1565

added ability to use config file to shard vicuna #1565

Conversation

Eliasj42 commented Jun 21, 2023

powderluv commented Jun 21, 2023

Eliasj42 Jun 21, 2023

Choose a reason for hiding this comment

Eliasj42 Jun 21, 2023 • edited Loading

Choose a reason for hiding this comment

Eliasj42 Jun 21, 2023

Choose a reason for hiding this comment

dan-garvey Jun 21, 2023

Choose a reason for hiding this comment

Eliasj42 Jun 21, 2023

Choose a reason for hiding this comment

dan-garvey left a comment

Choose a reason for hiding this comment

dan-garvey Jun 21, 2023

Choose a reason for hiding this comment

Eliasj42 commented Jun 22, 2023

Eliasj42 Jun 21, 2023 •

edited

Loading