forked from KhronosGroup/SPIRV-Tools
-
Notifications
You must be signed in to change notification settings - Fork 0
/
CHANGES
1331 lines (1289 loc) · 64.6 KB
/
CHANGES
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
Revision history for SPIRV-Tools
v2021.3-dev 2021-06-22
- Start v2021.3-dev
v2021.2 2021-06-18
- General
- Support SPV_KHR_subgroup_uniform_control_flow (#4318)
- Support Intel extensions for fixed point and hls-float (#4321)
- Fix crash when optimizing shaders with DebugPrintf (#4280)
- Validator
- Support Vulkan Storage Class for Execution Model (#4212)
- Optimizer
- Handle SPV_KHR_vulkan_memory_model in dead-code elimination (#4320)
- Support folding OpBitcast with numeric constants (#4247)
- Fuzz
- Add tests for MaybeGet* functions in fuzzerutil (#4284)
- Fix OutlineFunction in presence of unreachable blocks (#4308)
- Fix def-use update in PermutePhiOperands (#4309)
- Swap positions of two functions in a module (#4236)
v2021.1 2021-04-19
- General
- Support SPV_KHR_linkonce_odr, SPV_KHR_expect_assume (#4161)
- Fixes for the vscode language server extension (#4150)
- Validator
- Add validation for SPV_EXT_shader_atomic_float_min_max (#4105)
- Add Vulkan Execution Scope checks (#4183)
- Vulkan 64-bit OpAtomicStore check (#4163)
- Optimizer
- Add interpolate legalization pass (#4220)
- Fuzz
- Various performance optimizations
- Do not add too many dead blocks (#4217)
- Add WGSL compatibility flag to context (#4193)
- Add persistent state to the fuzzer (#4137)
v2020.7 2021-02-16
- General
- Support pending Intel extensions (#4116)
- Remove WebGPU support (#4108)
- Validator
- Vulkan image gather constant component (#4133)
- Add Vulkan PSB64 convert VUID (#4122)
- Validate SPV_KHR_workgroup_memory_explicit_layout (#4128)
- Validate VK_KHR_zero_initialize_workgroup_memory (#4124)
- Add Vulkan image gather offset VUID (#4118)
- Label Vulkan atomic semantics VUIDs (#4120)
- Label VUID 04662 (#4123)
- Label VUID 04683 (#4121)
- Add Vulkan EXT builtins (#4115)
- Validate Sampled=1 for Vulkan ImageQuerySizeLod, ImageQueryLevels, ImageQueryLod (#4103)
- Add Vulkan Memory Scope VUs (#4106)
- Add Vulkan Addressing Model check (#4107)
- Vulkan atomic storage class (#4079)
- Label standalone Vulkan VUID (#4091)
- Add Vulkan decroation VUID (#4090)
- Add Vulkan FP Mode VUID (#4088)
- Fix Vulkan image sampled check (#4085)
- Add Vulkan ForwardPointer VUID (#4089)
- Add Vulkan ImageTexelPointer format check (#4087)
- Add Vulkan Group Operation VUID (#4086)
- Add first StandAlone VUID 04633 (#4077)
- Add Subgroup VUIDs (#4074)
- validate return type of OpImageRead (#4072)
- tighter validation of multisampled images (#4059)
- validate OpTypeImage Sampled values for environemnts (#4064)
- validate StorageImageMultisampled capability (#4062)
- Add last TessLevelOuter and TessLevelInner VUID (#4055)
- Add last ClipDistance and CullDistance VUID (#4054)
- Add last ViewportIndex and Layer VUID (#4053)
- Add last Position VUID (#4052)
- Allow forward pointer to be used in types generally (#4044)
- Optimizer
- Mark module as modified if convert-to-half removes decorations (#4127)
- Fix binding number calculation in desc sroa (#4095)
- Run DCE when SPV_KHR_shader_clock is used (#4049)
- Debug Info
- Set correct scope and line info for DebugValue (#4125)
- Avoid integrity check failures caused by propagating line instructions (#4096)
- Linker
- Linker usability improvements (#4084)
- Instrumentation
- Generate differentiated error codes for buffer oob checking (#4097)
- Fuzz
- Fix OpPhi handling in DuplicateRegionWithSelection (#4065)
v2020.6 2020-12-07
- General
CMake: Add SPIRV_TOOLS_BUILD_STATIC flag (#3910)
- Disassembler
Add some context comments to disassembly. (#3847)
- Optimizer
- Take new (raytracing) termination instructions into account. (#4050)
- Do run DCE if SPV_KHR_ray_query is used. (#4047)
- Handle 8-bit index in elim dead member (#4043)
- Add texel buffer out-of-bounds checking instrumentation (#4038)
- Update MeshShadingNV dependencies (and land Ray tracing updates) (#4028)
- Fix buffer oob instrumentation for matrix refs (#4025)
- Fix SSA re-writing in the presence of variable pointers. (#4010)
- Add support to prevent functions from being inlined if they have
DontInline flag (#3858)
- Add SPV_EXT_shader_image_int64 (#3852)
- Support SPV_KHR_fragment_shading_rate (#3943)
- Fix use-after-move in val/validate.cpp (#3848)
- Debug Info
- properly preserve DebugValue indexes operand (#4022)
- Add DebugValue for invisible store in single_store_elim (#4002)
- Propagate OpLine to all applied instructions in spirv-opt (#3951)
- Add DebugValue for DebugDecl invisible to value assignment (#3973)
- Add DebugValue for function param regardless of scope (#3923)
- Debug info preservation in convert-local-access-chains pass (#3835)
- Debug info preservation in redundancy-elimination pass (#3839)
- Debug info preservation in if-conversion pass (#3861)
- Validator
- Add validation support for the ray tracing built-in variables (#4041)
- Use less stack space when validating Vulkan builtins (#4019)
- Fix SPV_KHR_fragment_shading_rate VUID label (#4014)
- Label Layer and ViewportIndex VUIDs (#4013)
- Allow the ViewportIndex and Layer built-ins on SPIR-V 1.5 (#3986)
- Fix validation of OpPhi instructions (#3919)
- Fuzz
- Fix facts arising from CompositeConstruct (#4034)
- Do not flatten conditionals that create synonyms (#4030)
- Add support for reining in rogue fuzzer passes (#3987)
- Fix assertion failure in FuzzerPassAddCompositeExtract (#3995)
- Fix invalid equation facts (#4009)
- Fix bugs in TransformationFlattenConditionalBranch (#4006)
- Fix bug related to transformation applicability (#3990)
- Add expand vector reduction transformation (#3869)
- Add FuzzerPassAddCompositeExtract (#3904)
- Fix mismatch with shrinker step limit (#3985)
- Fix off-by-one error in replayer (#3982)
- Get order right for OpSelect arguments (#3974)
- Do not add synonym-creating loops in dead blocks (#3975)
- Skip OpTypeSampledImage when propagating up (#3976)
- Pass OpUndef in function call if needed (#3978)
- Fix off-by-one in TransformationCompositeConstruct (#3979)
- Tolerate absent ids in data synonym fact management (#3966)
- Fix to id availability (#3971)
- Fix operand types (#3962)
- Don't flatten conditional if condition is irrelevant (#3944)
- Do not produce OpPhis of type OpTypeSampledImage (#3964)
- Restrict fuzzer pass to reachable blocks (#3970)
- Handle more types when extending OpPhi instructions (#3969)
- Skip early terminator wrappers when merging returns (#3968)
- Avoid irrelevant constants in synonym-creating loops (#3967)
- Skip dead blocks in FuzzerPassAddOpPhiSynonyms (#3965)
- Avoid the type manager when looking for struct types (#3963)
- Fix to TransformationDuplicateRegionWithSelection (#3941)
- Skip OpFunction when replacing irrelevant ids (#3932)
- Use component-wise selectors when flattening conditional branches (#3921)
- Avoid void struct member when outlining functions (#3936)
- Do not allow Block-decorated structs when adding parameters (#3931)
- Fix to operand id type (#3937)
- Handle dead blocks in TransformationEquationInstruction (#3933)
- Do not allow sampled image load when flattening conditionals (#3930)
- Take care of OpPhi instructions when inlining (#3939)
- Fix to TransformationInlineFunction (#3913)
- Wrap early terminators before merging returns (#3925)
- Lower probability of adding bit instruction synonyms (#3917)
- Fix handling of OpPhi in FlattenConditionalBranch (#3916)
- Avoid creating blocks without parents (#3908)
- Do not allow creation of constants of block-decorated structs (#3903)
- Fixes related to irrelevant ids (#3901)
- Fix to transformation that adds a synonym via a loop (#3898)
- Fix to duplicate region with selection (#3896)
- Do not expose synonym facts for non-existent ids (#3891)
- Do not add synonyms involving irrelevant ids (#3890)
- Do not replace irrelevant ids that are not in blocks (#3892)
- Wrap OpKill and similar in function calls (#3884)
- Integrate spirv-reduce with shrinker (#3849)
- Report fresh ids in transformations (#3856)
- Support OpNot bit instruction case (#3841)
- Return IR and transformation context after replay (#3846)
v2020.5 2020-09-22
- General
- Enable building with BUILD_SHARED_LIBS=1 (#3490)
- Avoid using /MP4 for clang on windows. (#3662)
- Fix compiler error on macOS with XCode12. (#3836)
- Optimizer
- Preserve OpenCL.DebugInfo.100 through private-to-local pass (#3571)
- Preserve debug info in scalar replacement pass (#3461)
- Debug info preservation in loop-unroll pass (#3548)
- Preserve debug info in dead-insert-elim pass (#3652)
- Improve non-semantic instruction handling in the optimizer (#3693)
- Let ADCE pass check DebugScope (#3703)
- Add undef for inlined void function (#3720)
- Fix SSA-rewrite to remove DebugDeclare for variables without loads (#3719)
- Handle DebugScope in compact-ids pass (#3724)
- Add buffer oob check to bindless instrumentation (#3800)
- Validator
- Update OpenCL capabilities validation (#3149)
- Validator support for non-semantic clspv reflection (#3618)
- OpenCL.DebugInfo.100 DebugTypeArray with variable size (#3549)
- Only validation locations for appropriate execution models (#3656)
- Validate more OpenCL.DebugInfo.100 instructions (#3684)
- Allow DebugTypeTemplate for Type operand (#3702)
- spirv-val: Add Vulkan VUID labels to BuiltIn (#3756)
- Allow SPV_KHR_8bit_storage extension. (#3780)
- Validate SPIRV Version number when parsing binary header (#3834)
- Reduce
- Support reducing a specific function (#3774)
- Fuzz
- adds TransformationReplaceCopyObjectWithStoreLoad (#3567)
- adds TransformationReplaceCopyMemoryWithLoadStore (#3575)
- adds TransformationReplaceLoadStoreWithCopyMemory (#3586)
- Implement the OpOuterProduct linear algebra case (#3617)
- Pass to replace int operands with ints of opposite signedness (#3612)
- TransformationMoveInstructionDown (#3477)
- Add TransformationMakeVectorOperationDynamic (#3597)
- TransformationReplaceAddSubMulWithCarryingExtended (#3598)
- FuzzerPassPropagateInstructionsUp (#3478)
- add FuzzerPassAddCompositeInserts (#3606)
- Add inline function transformation (#3517)
- Transformation to replace the use of an irrelevant id (#3697)
- Add SPIRV_FUZZ_PROTOC_COMMAND (#3789)
- Add TransformationDuplicateRegionWithSelection (#3773)
- Transformation to flatten conditional branch (#3667)
- Handle OpPhis in TransformationInlineFunction (#3833)
- Create synonym of int constant using a loop (#3790)
- Support dead blocks in TransformationAddSynonym (#3832)
- Linker
v2020.4 2020-07-22
- General
- Changed variable names to be more descriptive (#3433)
- Add support to GPU-AV instrumentation for Task and Mesh shaders (#3512)
- Permit Simple and GLSL450 memory model in WEBGPU_0 (#3463)
- Support SPV_KHR_terminate_invocation (#3568)
- Optimizer
- Preserving debug information in optimizations
(#3389,#3420,#3425,#3356,#3459,#3444,#3492,#3451,#3497i,#3498,#3542)
- Eliminate branches with condition of OpConstantNull (#3438)
- Use structured order to unroll loops. (#3443)
- Updated desc_sroa to support flattening structures (#3448)
- Support OpCompositeExtract pattern in desc_sroa (#3456)
- Fix ADCE pass bug for mulitple entries (#3470)
- Sink pointer instructions in merge return (#3569)
- Validator
- Validate location assignments (#3308)
- Fix reachability in the validator (#3541)
- Reduce
- Fuzz
- Add support for OpSpecConstant* (#3373)
- Add replace linear algebra instruction transformation (#3402)
- Implement vector shuffle fuzzer pass (#3412)
- Swap operands in OpBranchConditional (#3423)
- Permute OpPhi instruction operands (#3421)
- Add FuzzerPassAddCopyMemoryInstructions (#3391)
- TransformationInvertComparisonOperator (#3475)
- Add variables with workgroup storage class (#3485)
- Add image sample unused components transformation (#3439)
- TransformationReplaceParameterWithGlobal (#3434)
- Support adding dead break from back-edge block (#3519)
- Fuzzer pass to interchange zero-like constants (#3524)
- Linker
v2020.3 2020-05-27
- General
- Prevent Effcee from installing things when building spirv-tools with testing enabled (#3256)
- Update acorn version (#3294)
- If SPIRV-Headers is in our tree, include it as subproject (#3299)
- allow cross compiling for Windows Store, UWP, etc. (#3330)
- Optimizer
- Remove deprecated interfaces from instrument passes (#3361)
- Preserve debug info in inline pass (#3349)
- Handle more cases in dead member elim (#3289)
- Preserve debug info in eliminate-dead-functions (#3251)
- Fix Struct CFG analysis for single block loop (#3293)
- Add tests for recently added command line option (#3297)
- Consider sampled images as read-only storage (#3295)
- Allow various validation options to be passed to spirv-opt (#3314)
- Add debug information analysis (#3305)
- Preserve debug info for wrap-opkill (#3331)
- refactor inlining pass (#3328)
- Add unrolling to performance passes (#3082)
- Validator
- Add validation support for ImageGatherBiasLodAMD (#3363)
- Validate ShaderCallKHR memory scope (#3332)
- Validate Buffer and BufferBlock apply only to struct types (#3259)
- Reduce
- increase default step limit (#3327)
- Remove unused uniforms and similar (#3321)
- Fuzz
- Add support for StorageBuffer (#3348)
- Add validator options (#3254)
- Limit adding of new variables to 'basic' types (#3257)
- Transformation to add OpConstantNull (#3273)
- Handling of more fuzzing opportunities (#3277, #3280, #3281, #3290, #3292)
- Respect rules for OpSampledImage (#3287)
- Do not outline regions that produce pointer outputs (#3291)
- Linker
v2020.2 2020-03-26
- General:
- Support extended instructions in the vscode language server
- Make spvOpcodeString part of the public API (#3174)
- Added guide to writing a spirv-fuzz fuzzer pass (#3190)
- Add support for KHR_ray_{query,tracing} extensions (#3235)
- Optimizer
- Debug Printf support (#3215)
- Add data structure for DebugScope, DebugDeclare in spirv-opt (#3183)
- Fix identification of Vulkan images and buffers (#3253)
- Validator
- Add support for SPV_AMD_shader_image_load_store_lod (#3186)
- Add validation rules for OpenCL.DebugInfo.100 extension (#3133)
- Adding WebGPU specific Workgroup scope rule (#3204)
- Disallow phis of images, samplers and sampled images (#3246)
- Reduce
- Fuzz
- Fuzzer passes to add local and global variables (#3175)
- Add fuzzer passes to add loads/stores (#3176)
- Fuzzer pass to add function calls (#3178)
- Fuzzer pass that adds access chains (#3182)
- Fuzzer pass to add equation instructions (#3202)
- Add swap commutable operands transformation (#3205)
- Add fuzzer pass to permute function parameters (#3212)
- Allow OpPhi operand to be replaced with a composite synonym (#3221)
- Linker
v2020.1 2020-02-03
- General:
- Add support for SPV_KHR_non_semantic_info (#3110)
- Support OpenCL.DebugInfo.100 extended instruction set (#3080)
- Added support for Vulkan 1.2
- Add API function to better handle getting the necessary environment (#3142)
- Clarify mapping of target env to SPIR-V version (#3150)
- Implement constant folding for many transcendentals (#3166)
- Optimizer
- Change default version for CreatInstBindlessCheckPass to 2 (#3096, #3119)
- Better handling of OpLine on merge blocks (#3130)
- Use placeholder switch instead of placeholder loop in MergeReturn pass. (#3151)
- Handle TimeAMD in AmdExtensionToKhrPass. (#3168)
- Validator
- Fix structured exit validation (#3141)
- Reduce
- Fuzz
- Fuzzer pass to merge blocks (#3097)
- Transformation to add a new function to a module (#3114)
- Add fuzzer pass to perform module donation (#3117)
- Fuzzer passes to create and branch to new dead blocks (#3135)
- Fuzzer pass to add composite types (#3171)
- Linker:
- Remove names and decorations of imported symbols (#3081)
v2019.5 2019-12-11
- General:
- Export SPIRV-Tools targets on installation
- SPIRV-Tools support for SPIR-V 1.5 (#2865)
- Add WebGPU SPIR-V Assembler in JavaScript. (#2876)
- Add Bazel build configuration. (#2891)
- Add support for building with emscripten (#2948)
- Update SPIR-V binary header test for SPIR-V 1.5 (#2967)
- Add fuzzer for spirv-as call path (#2976)
- Improved CMake install step. (#2963)
- Add fuzzer for spirv-dis call path (#2977)
- Ensure timestamp does not vary with timezone. (#2982)
- Add a vscode extension for SPIR-V disassembly files (#2987)
- Add iOS as a supported platform (#3001)
- utils/vscode: Add SPIR-V language server support
- Respect CMAKE_INSTALL_LIBDIR in installed CMake files (#3054)
- Permit the debug instructions in WebGPU SPIR-V (#3063)
- Add support for Fuchsia. (#3062)
- Optimizer
- Add descriptor array scalar replacement (#2742)
- Add pass to wrap OpKill in a function call (#2790)
- Fold FMix during constant folding. (#2818)
- Add pass to replace AMD shader ballot extension (#2811)
- Add pass to make Float32 operation relax precision (#2808)
- Add pass to make relax precision operation Float16 (#2808)
- Add pass to replace uses of 3 AMD extensions (#2814)
- Fold Min, Max, and Clamp instructions. (#2836)
- Better handling of OpKill in continues (#2842,#2922,#2933)
- Enable OpTypeCooperativeMatrix specialization (#2927)
- Support constant-folding UConvert and SConvert (#2960)
- Update Offset to ConstOffset bitmask if operand is constant. (#3024)
- Improve RegisterSizePasses (#3059)
- Folding: perform add and sub on mismatched integer types (#3084)
- Graphics robust access: use signed clamp (#3073)
Fixes:
- Instrument: Fix version 2 output record write for tess eval shaders. (#2782)
- Instrument: Add support for Buffer Device Address extension (#2792)
- Fix check for changed binary in API call. (#2798)
- For WebGPU<->Vulkan optimization, set correct execution environment (#2834)
- Handle OpConstantNull in copy-prop-arrays. (#2870)
- Use OpReturn* in wrap-opkill (#2886)
- Validator
- Add generic builtin validation of target (#2843)
- Extra resource interface validation (#2864)
- Adding valilidation checks for OpEntryPoint duplicate names and execution mode (#2862)
- Relaxed bitcast with pointers (#2878)
- Validate physical storage buffer restrictions (#2930)
- Add SPV_KHR_shader_clock validation (#2879, #3013)
- Validate that selections are structured (#2962)
- Disallow use of OpCompositeExtract/OpCompositeInsert with no indices (#2980)
- Check that derivatives operate on 32-bit values (#2983)
- Validate array stride does not cause overlap (#3028)
- Validate nested constructs (#3068)
Fixes:
- Fix validation of constant matrices (#2794)
- Update "remquor" validation
- Only allow previously declared forward refs in structs (#2920)
- Reduce
- Remove relaxed precision decorations (#2797)
- Reduce/fuzz: improve command line args (#2932)
- Improve remove unref instr pass (#2945)
Fixes:
- Fuzz
- Fix add-dead-break and add-dead-continue passes to respect dominance (#2838)
- Add fuzzer pass to copy objects (#2853)
- Add fuzzer pass to replace ids with synonyms (#2857)
- Allow validation during spirv-fuzz replay (#2873)
- Employ the "swarm testing" idea in spirv-fuzz (#2890)
- reduce/fuzz: improve command line args (#2932)
- option to convert shader into a form that renders red (#2934)
- Add fuzzer pass to change selection controls (#2944)
- add transformation and pass to construct composites (#2941)
- Add fuzzer pass to change loop controls (#2949)
- Add fuzzer pass to change function controls (#2951)
- Add fuzzer pass to add NoContraction decorations (#2950)
- Add missing functionality for matrix composites (#2974)
- Fuzzer pass to adjust memory access operands (#2968)
- Transformation to extract from a composite object (#2991)
- Vector shuffle transformation (#3015)
- Improve debugging facilities (#3074)
- Function outlining fuzzer pass (#3078)
v2019.4 2019-08-08
- General:
- Memory model support for SPIR-V 1.4
- Add new spirv-fuzz tool
- Add option for base branch in check_code_format.sh
- Removed MarkV and Stats code. (#2576)
- Instrument: Add version 2 of record formats (#2630)
- Linker: Better type comparison for OpTypeArray and OpTypeForwardPointer (#2580)
- Optimizer
- Bindless Validation: Instrument descriptor-based loads and stores (#2583)
- Better folding for OpSpecConstantOp (#2585, #2614)
- Add in individual flags for Vulkan <-> WebGPU passes (#2615)
- Handle nested breaks from switches. (#2624)
- Optimizer: Handle array type with OpSpecConstantOp length (#2652)
- Perform merge return with single return in loop. (#2714)
- Add --preserve-bindings and --preserve-spec-constants (#2693)
- Remove Common Uniform Elimination Pass (#2731)
- Allow ray tracing shaders in inst bindle check pass. (#2733)
- Add pass to inject code for robust-buffer-access semantics (#2771)
- Treat access chain indexes as signed in SROA (#2776)
- Handle RelaxedPrecision in SROA (#2788)
- Add descriptor array scalar replacement (#2742)
Fixes:
- Handle decorations better in some optimizations (#2716)
- Change the order branches are simplified in dead branch elim (#2728)
- Fix bug in merge return (#2734)
- SSA rewriter: Don't use trivial phis (#2757)
- Record correct dominators in merge return (#2760)
- Process OpDecorateId in ADCE (#2761)
- Fix check for unreachable blocks in merge-return (#2762)
- Handle out-of-bounds scalar replacements. (#2767)
- Don't move debug or decorations when folding (#2772)
- Protect against out-of-bounds references when folding OpCompositeExtract (#2774)
- Validator
- Validate loop merge (#2579)
- Validate construct exits (#2459)
- Validate OpenCL memory and addressing model environment rules (#2589)
- Validate OpenCL environment rules for OpTypeImage (#2606)
- Allow breaks to switch merge from nested construct (#2604)
- Validate OpenCL environment rules for OpImageWrite (#2619)
- Allow arrays of out per-primitive builtins for mesh shaders (#2617)
- Validate OpenCL rules for ImageRead and OpImageSampleExplicitLod (#2643)
- Add validation for SPV_EXT_fragment_shader_interlock (#2650)
- Add builtin validation for SPV_NV_shader_sm_builtins (#2656)
- Add validation for Subgroup builtins (#2637)
- Validate variable initializer type (#2668)
- Disallow stores to UBOs (#2651)A
- Validate Volatile memory semantics bit (#2672)
- Basic validation for Component decorations (#2679)
- Validate that in OpenGL env block variables have Binding (#2685)
- Validate usage of 8- and 16-bit types with only storage capabilities (#2704)
- Add validation for SPV_EXT_demote_to_helper_invocation (#2707)
- Extra small storage validation (#2732)
- For Vulkan, disallow structures containing opaque types (#2546)
- Validate storage class OpenCL environment rules for atomics (#2750)
- Update OpControlBarriers rules for WebGPU (#2769)
- Update OpMemoryBarriers rules for WebGPU (#2775)
- Update WebGPU validation rules of OpAtomic*s (#2777)
Fixes:
- Disallow merge targeting block with OpLoopMerge (#2610)
- Update vloadn and vstoren validation to match the OpenCL Extended
Instruction Set Specification (#2599)
- Update memory scope rules for WebGPU (#2725)
- Allow LOD ops in compute shaders with derivative group execution modes (#2752)
- Reduce
Fixes:
v2019.3 2019-05-14
- General:
- Require Python 3 since Python 2 will out of service soon.
- Add a continuous test that does memory checks using the address sanitizer.
- Fix the build files so the SPIRV_USE_SANITIZER=address build works.
- Packaging top of tree build artifacts again.
- Added support for SPIR-V 1.4. (#2550)
- Optimizer
- Remove duplicates from list of interface IDs in OpEntryPoint instruction (#2449)
- Bindless Validation: Descriptor Initialization Check (#2419)
- Add option to validate after each pass (#2462)
- Add legalization pass to fix mismatched pointer (#2430, #2535)
- Add error messages when the input contains unknown instructions. (#2487)
- Add pass to convert from WebGPU Spir-V to Vulkan Spir-V and back. (#2495)
Fixes:
- #2412: Dead memeber elimination should not change input and output variables.
- #2405: Fix OpDot folding of half float vectors.
- #2391: Dead branch elim should not fold away back edges.
- #2441: Removing decorations when doing constant propagation.
- #2455: Maintain inst to block mapping in merge return.
- #2453: Fix merge return in the face of breaks.
- #2456: Handle dead infinite loops in DCE.
- #2458: Handle variable pointer in some optimizations.
- #2452: Fix dead branch elimination to handle unreachable blocks better.
- #2528: Fix undefined bit shift in sroa.
- #2539: Change implementation of post order CFG traversal.
- Validator
- Add validation of storage classes for WebGPU (#2446)
- Add validation for ExecutionMode in WebGPU (#2443)
- Implement WebGPU specific CFG validation (#2386)
- Allow NonWritable to target struct members. (#2420)
- Allow storage type mismatch for parameter in relaxed addressing mode.
- Allow non memory objects as parameter in relaxed addressing mode.
- Disallow nested Blocks and buffer blocks (#2410).
- Add validation for SPV_NV_cooperative_matrix (#2404)
- Add --strip-atomic-counter-memory (#2413)
- Check OpSampledImage is only passed into valid instructions (#2467)
- Handle function decls in Structured CFG analysis (#2474)
- Validate that OpUnreacahble is not statically reachable (#2473)
- Add pass to generate needed initializers for WebGPU (#2481)
- Allow images without format for OpenCL. (#2470)
- Remove unreachable block validation (#2525)
- Reduce runtime of array layout checks (#2534)
- Add validation specific to OpExecutionModeId (#2536)
- Validate sign of int types. (#2549)
- VK_KHR_uniform_buffer_standard_layout validation (#2562)
Fixes:
- #2439: Add missing DepthGreater case to Fragment only check.
- #2168: Disallow BufferBlock on StorageBuffer variables for Vulkan.
- #2408: Restrict and Aliased decorations cannot be applied to the same id.
- #2447: Improve function call parameter check.
- Reduce
- Add Pass to remove unreferenced blocks. (#2398)
- Allows passing options to the validator. (#2401)
- Improve reducer algorithm and other changes (#2472)
- Add Pass to remove selections (#2485)
- Add passes to simplify branches (#2507)
Fixes:
- #2478: fix loop to selection pass for loops with combined header/continue block
v2019.2 2019-02-20
- General:
- Support SPV_EXT_physical_storage_buffer
- A number of memory leak have been fixed.
- Removed use of deprecated Google test macro:
- Changed the BUILD.gn to only build tests in Chromium.
- Optimizer
- Upgrade memory model improvments for modf and frexp.
- Add a new pass to move loads closer to their uses: code sinking.
- Invalidating the type manager now invalidates the constnat manager.
- Expand instrumentation pass for bindless bounds checking to runtime-sized descriptor arrays.
- Add a new pass that removes members from structs that are not used: dead member elimination.
Fixes:
- #2292: Remove undefined behaviour when folding bit shifts.
- #2294: Fixes for instrumentation code.
- #2293: Fix overflow when folding -INT_MIN.
- #2374: Don't merge unreachable blocks when merging blocks.
- Validator
- Support SPV_KHR_no_integer_wrap and related decorations.
- Validate Vulkan rules for OpTypeRuntimeArray.
- Validate NonWritable decoration.
- Many WebGPU specific validation rules were added.
- Validate variable pointer related function call rules.
- Better error messages.
Fixes:
- #2307: Check forwards references in OpTypeArray.
- #2315, #2303: Fixed the layout check for relaxed layout.
- #1628: Emit an error when an OpSwitch target is not an OpLabel.
- Reduce
- Added more documentation for spirv-reduce.
- Add ability to remove OpPhi instructions.
- Add ability to merge two basic blocks.
- Add ability to remove unused functions and unused basic blocks.
Fixes:
v2019.1 2019-01-07
- General:
- Created a new tool called spirv-reduce.
- Add cmake option to turn off SPIRV_TIMER_ENABLED (#2103)
- New optimization pass to update the memory model from GLSL450 to VulkanKHR.
- Recognize OpTypeAccelerationStructureNV as a type instruction and ray tracing storage classes.
- Fix GCC8 build.
- Add --target-env flag to spirv-opt.
- Add --webgpu-mode flag to run optimizations for webgpu.
- The output disassembled line number stead of byte offset in validation errors. (#2091)
- Optimizer
- Added the instrumentation passes for bindless validation.
- Added passes to help preserve OpLine information (#2027)
- Add basic support for EXT_fragment_invocation_density (#2100)
- Fix invalid OpPhi generated by merge-return. (#2172)
- Constant and type manager have been turned into analysies. (#2251)
Fixes:
- #2018: Don't inline functions with a return in a structured CFG contstruct.
- #2047: Fix bug in folding when volatile stores are present.
- #2053: Fix check for when folding floating pointer values is allowed.
- #2130: Don't inline recursive functions.
- #2202: Handle multiple edges between two basic blocks in SSA-rewriter.
- #2205: Don't unswitch a latch condition during loop unswitch.
- #2245: Don't fold branch in loop unswitch. Run dead branch elimination to fold them.
- #2204: Fix eliminate common uniform to place OpPhi instructions correctly.
- #2247: Fix type mismatches caused by scalar replacement.
- #2248: Fix missing OpPhi after merge return.
- #2211: After merge return, fix invalid continue target.
- #2210: Fix loop invariant code motion to not place code between merge instruction and branch.
- #2258: Handle CompositeInsert with no indices in VDCE.
- #2261: Have replace load size handle extact with no index.
- Validator
- Changed the naming convention of outputing ids with names in diagnostic messages.
- Added validation rules for UniformConstant variables in Vulkan.
- #1949: Validate uniform variable type in Vulkan
- Ensure for OpVariable that result type and storage class operand agree (#2052)
- Validator: Support VK_EXT_scalar_block_layout
- Added Vulkan memory model semantics validation
- Added validation checkes spefic to WebGPU environment.
- Add support for VK_EXT_Transform_feedback capabilities (#2088)
- Add validation for OpArrayLength. (#2117)
- Ensure that function parameter's type is not void (#2118)
- Validate pointer variables (#2111)
- Add check for QueueFamilyKHMR memory scope (#2144)
- Validate PushConstants annotation and type (#2140)
- Allow Float16/Int8 for Vulkan 1.0 (#2153)
- Check binding annotations in resource variables (#2151, #2167)
- Validate OpForwardPointer (#2156)
- Validate operation for OpSpecConstantOp (#2260)
Fixes:
- #2049: Allow InstanceId for NV ray tracing
- Reduce
- Initial commit wit a few passes to reduce test cases.
- Validation is run after each reduction step.
Fixes:
v2018.6 2018-11-07
- General:
- Added support for the Nvidia Turing and ray tracing extensions.
- Make C++11 the CXX standard in CMakeLists.txt.
- Enabled a parallel build for MSVC.
- Enable pre-compiled headers for MSVC.
- Added a code of conduct.
- EFFCEE and RE2 are now required when build the tests.
- Optimizer
- Unrolling loops marked for unrolling in the legalization passes.
- Improved the compile time of loop unrolling.
- Changee merge-return to create a placeholder loop around the function.
- Small improvement to merge-blocks to allow it to merge more often.
- Enforce an upper bound for the ids, and add option to set it.
- #1966: Report error if there are unreachable block before running merge return
Fixes:
- #1917: Allow 0 (meaning unlimited) as a parameter to --scalar-replacement
- #1915: Improve handling of group decorations.
- #1942: Fix incorrect uses of the constant manager. Avoids type mismatches in generated code.
- #1997: Fix dead branch elimination when there is a loop in folded selection.
- #1991: Fixes legality check in if-conversion.
- #1987: Add nullptr check to array copy propagation.
- #1984: Better handling of OpUnreachable in ADCE.
- #1983: Run merge return on reachable functions only.
- #1956: Handled atomic operations in ADCE.
- #1963: Fold integer divisions by 0 to 0.
- #2019: Handle MemberDecorateStringGOOGLE in ADCE and strip reflect.
- Validator
- Added validation for OpGroupNonUniformBallotBitCount.
- Added validation for the Vulkan memory model.
- Added support for VK_KHR_shader_atddomic_int64.
- Added validation for execution modes.
- Added validation for runtime array layouts.
- Added validation for 8-bit storage.
- Added validation of OpPhi instructions with pointer result type.
- Added checks for the Vulkan memory model.
- Validate MakeTexelAvailableKHR and MakeTexelVisibleKHR
- Allow atomic function pointer for OpenCL.
- FPRounding mode checks were implemented.
- Added validation for the id bound with an option to set the max id bound.
Fixes:
- #1882: Improve the validation of decorations to reduce memory usage.
- #1891: Fix an potential infinite loop in dead-branch-elimination.
- #1405: Validate the storage class of boolean objects.
- #1880: Identify arrays of type void as invalid.
- #487: Validate OpImageTexelPointer.
- #1922: Validate OpPhi instructions are at the start of a block correctly.
- #1923: Validate function scope variable are at the start of the entry block.
v2018.5 2018-09-07
- General:
- Support SPV_KHR_vulkan_memory_model
- Update Dim capabilities, to match SPIR-V 1.3 Rev 4
- Automated build bots no run tests for the VS2013 case
- Support Chromium GN build
- Use Kokoro bots:
- Disable Travis-CI bots
- Disable AppVeyor VisualStudio Release builds. Keep VS 2017 Debug build
- Don't check export symbols on OSX (Darwin): some installations don't have 'objdump'
- Reorganize source files and namespaces
- Fixes for ClangTidy, and whitespace (passes 'git cl presumit --all -uf')
- Fix unused param compile warnings/errors when Effcee not present
- Avoid including time headers when timer functionality is disabled
- Avoid too-stringent warnings flags for Clang on Windows
- Internal refactoring
- Add hooks for automated fuzzing
- Add testing of command line executables
- #1688: Use binary mode on stdin; fixes "spirv-dis <foo.spv" on Windows
- Optimizer
- The optimizer validates the module before it begins
- Add API to register passes by string name
- Fold a vector shuffle feeding a vector shuffle
- Add -combine-access-chains transform
- Refactor how IRContext is handled by passes
- Improve bookkeeping for instruction result type and result id
- Fix over-duplication of decorations
- Fix handling of exits from selections in dead-branch elimination, and dead code
elimination.
- Fix handling of certain kinds of flow control in merge-return
Fixes:
- #1721: Fix size bug when folding vector shuffles
- #1722: Fix size infinite loop when folding vector shuffles
- #1724: Fix finding a constant of a specific type
- #1727: Dead branch elim: Reorder blocks if needed to satisfy dominance rule
- #1729: Handle VariablePointers cases in various optimizations
- #1731: Fix vector shuffle with literal id indicating undef value
- #1736: Fix handling of decorations and phis in merge-return
- #1787: Fix handling of decorations related to access chains
- #1865: Avoid leaking memory for SPIR-V constant values
- Validator
- Improve error messages
- Avoid platform-dependent traversal ordering, to ensure consistent messages
- Use libspirv::Instruction where possible
- Add option to skip all block layout checks
- Validate all type IDs
- Validate uses of OpFunction
- Validate uses of OpTypeFunction
- Disallow a struct containing its own type https://crbug.com/874372
- #1685: Vulkan permits non-monotonic offsets for block members
- #1697: Enforce block layout rules even when using relaxed block layout option
- #1719: Fix line number for vector shuffle valiation error
- #1789: Avoid assertion failure when validating some functions
- #1800: Fix validation of OpCopyMemorySized
- #1822: Stop enforcing struct member offset montonicity
- #1831: Disallow void members in structs
v2018.4 2018-07-08
- General:
- Support SPV_KHR_8bit_storage
- Add gclient and presubmit configurations
- Enable Kokoro build bots (#1625)
- Group tests into fewer executables, reduces load on CI
- Port test script to Python 3
- Symbol export tests respect SPIRV_SKIP_TESTS
- #1596: Operand lookup succeeds if enabled by a capability
- #1624: Instruction lookup succeeds if enabled by a capability
- Refactoring namespaces:
- #1678: Change libspirv to spvtools
- Code in source/utils moved into spvtools::utils
- Code in source/comp moved into spvtools::comp
- Optimizer:
- Remove insert-extract-elim pass. Use simplification pass instead.
- Preserve instruction-to-block mapping in most passes, to reduce runtime.
- Small vector optimization for operands
- Add pass to move Private variables to Function. Increase opportunity to optimize.
- Fixes:
#1120: Check static uses of entry point interfaces
#1372: Avoid merging some structs, to preserve names for reflection
#1577: Scalar replacement uses only undecorated types.
#1578: Fix handling of forward-pointer types, and types embedding pointers
to themselves.
#1591: Inliner: Callee variable with initializer should have a store at the call site.
#1634: Fix crash: Use type id in vector type lookup
#1649: Fix assert in compact-ids pass
Fix constant folder: ensure it uses the right type
#1659: Folding rules added to IRContext. Avoids leak.
- Validator
- Add work-in-progress WebGPU environment. Disallows OpUndef
- #670, #1581: Improve error messages; disassemble instruction
- #491: Check structured switches
- #937: Check layout rules for Block and BufferBlock in Uniform, StorageBuffer, PushConstant
- #1281: Check invalid branches into structured constructs
- #1522: Disallow array-of-arrays with DescriptorSets
- #1577: Allow duplicate pointer types.
- #1581: Better messages: output ID names along with numbers in more cases.
- #1597: Check Vulkan 1.1 capabilities
- #1618: Check invalid exit from structured case construct
- #1622: Run IdPass before DataRulesPass
- #1632: Reduce test time by artificially lowering limits in limit test
- #1638: Block-decorated structs member order must respect offset order
- #1657: Improve CFG validation diagnostics
- Khronos SPIR-V #337: GLSL.std.450 Refract instruction Eta param can be any float scalar.
- #1606: PushConstant Blocks follow storage-buffer layout rules
- #1664: Check layout of StorageBuffer variables with Block decoration, using storage buffer
rules
- #1666: Layout validation should permit {vec3; float} packing
- #1637, #1668: Layout validation uses RowMajor, ArrayStride, MatrixStride properly
- Linker
- Avoid buffer overrun when creating OpModuleProcessed
v2018.3 2018-05-25
- General:
- Support SPV_EXT_descriptor_indexing
- Support SPV_GOOGLE_decorate_string
- Support SPV_GOOGLE_hlsl_functionality1
- Support SPV_NV_shader_subgroup_partitioned
- Use "unified1" grammar from SPIRV-Headers
- Simplify support for new extensions. Assembler, disassembler, and simple validation
support is automatic if new tokens are introduced with appropriate extension
attributes in the "unified1" SPIR-V core grammar.
- Disassembler: Emit more digits on floating point, to reliably reproduce all
significand bits. (Use std::max_digits10 instead of std::digits10)
- Fix compilation for old XCode versions: Explicit construction required for std::set.
- Optimizer:
- Add --strip-reflect
- Add --time-report
- Add --loop-fission
- Add lop fusion.
- Add loop peeling pass and internal utility.
- Improve optimizer runtime.
- Merge-return now works with structured control flow.
- New (faster) SSA rewriter to convert local loads and stores to SSA IDs and phis.
Can replace load/store elimination passes.
- Fix instruction folding case: insertion that feeds and extract, when the extract
remains.
- Fold OpDot.
- Fold OpFNegate.
- Fold multply and divide of same value.
- Fold FClamp feeding a compare.
- Fold OpLoad feeding an extract, to reduce excessive copying. (#1547)
- Fold Fmix feeding an extract.
- Use simplification pass instead of insert-extract elimination.
- Constant fold OpVectorTimesScalar.
- Copy propagate arrays, in simple cases.
- Aggressive dead code elimination: Can remove more instructions, e.g. derivatives.
- Aggressive dead code elimination: Remove Workgroup variables that are written but not read.
- Better handling of OpImageTexelPointer
- Initial utilities for scalar evolution.
- Add Vector dead code elimination.
- Each pass can only run once.
- Allow code hosting in if-conversion.
- Add external interface for adding a PassToken, so external code can make their own
passes.
- Fixes:
#1404: Don't optimize away the compute compute workgroup size constant.
#1407: Remove a bad assertion
#1456: Fix bug in SSA rewriter related to variables updated in loops.
#1487: Fix long runtime in Dead insertion elimination: Don't revist select phi nodes.
#1492: Aggressive dead code elimination can remove OpDecorateStringGOOGLE.
#1527: Fix inlining of functions having OpKill and OpUnreachable.
#1559: Fix assert failure in reduce-load-size pass.
#1556: Aggressive dead code elimination: Fix handling of OpCopyMemory.
- Validator:
- Check Vulkan built-in variables
- Check Vulkan-specific atomic result type rule.
- Relax control barrier check for SPIR-V 1.3. Fixes #1427
- Check OpPhi.
- Check OpMemoryModel.
- Stop checking sizes derived from spec-constants.
- Re-enable checks for OpUConvert.
- Vulkan: Fix check for PrimitiveId: Permit as Input in fragment shader.
- Validate binary version for the given target environment.
- Add tests for OpBranch checks.
- Vulkan 1.1: Check scope for non-uniform subgroup operations.
- Fix checks for SPV_AMD_gpu_shader_int16.
- Fix logical layout check for OpDecorateId.
- Fix checks for ViewportIndex & Layer for Vulkan and SPV_EXT_shader_viewport_index_layer.
- Fixes:
#1470: Vulkan: Don't restrict WorkgroupSize to Input storage class.
#1469: Vulkan: Permit Subgroup memory scope for Vulkan 1.1.
#1472: Per-vertex variable validation fixes.
#1483: Valdiate barrier execution scopes for Vulkan 1.1.
- Fixes:
#898: Linker properly removes FuncParamAttr from imported symbols.
#924, #1174: Fix handling of decoration groups in optimizer, linker.
v2018.2 2018-03-07
- General:
- Support SPIR-V 1.3 and Vulkan 1.1.
- Default target environment is now SPIR-V 1.3. For command-line tools,
use the --target-env option to override the default. Examples:
# Generate a SPIR-V 1.0 binary instead of SPIR-V 1.3
spirv-as --target-env spv1.0 a.spvasm -o a.spv
spirv-as --target-env vulkan1.0 a.spvasm -o a.spv
# Validate as Vulkan 1.0
spirv-val --target-env vulkan1.0 a.spv
- Support SPV_GOOGLE_decorate_string and SPV_GOOGLE_hlsl_functionality1
- Fixes:
- Fix Android.mk build. Compilation was failing due to missing definitions of
SpvCapabilityFloat16ImageAMD and other enumerated values.
- Optimizer: Avoid generating duplicate names when merging types.
- #1375: Validator: SPV_AMD_gpu_shaer_half_float implicitly allows declaration
of the 16-bit floating point type.
- #1376: Optimizer: Avoid folding half-precision float.
v2018.1 2018-03-02
- General:
- Support Visual Studio 2013 again. (Continue support for VS 2015 and VS 2017.)
- Support building SPIRV-Tools as a shared library.
- Improve the HLSL legalization optimization recipe. #1311
- Optimizer:
- General speedups.
- Remove generic dead code elimination functionality from transforms:
--eliminate-local-single-block
--eliminate-local-single-store
--eliminate-local-multi-store
To recover the previous behaviour, a recipe using those transforms should now
also invoke the --eliminate-dead-code-aggressive transform.
- Improve folding, including coverage for floating point, OpSelect, and arithmetic
with non-trivial constant operands.
- Add loop-invariant code motion pass.
- Add loop-unrolling pass, for honouring unroll hits.
- Add loop-unswitch pass.
- Add instruction simplification pass.
- Aggressive dead code elimination: Understands capability hierarchy when finding
instructions it can eliminate (combinators). (PR #1268)
- CCP can now fold floating point arithmetic. #1311
- Validator:
- Validate barrier instructions.
- Check Vulkan-specific rules for atomics.
- Check Vulkan prohibition of Location or Component decorations on BuiltIn variables.
- Linker:
- Add --verify-ids option
- Add option to allow a resulting module to be partially linked.
- Handle OpModuleProcessed (instructions in SPIR-V layout section 7c)
- Fixes:
- #1265: Optimizer: Fix use-after free bug in if-conversion. (Fix object lifecycle bug
in type manager.)
- #1282: Fix new warnings found by GCC 8.0.1.
- #1285: Optimizer: Fix random failures during inlining. (Dangling references in DefUseManager)
- #1295: Optimizer: Fix incorrect handling of Phi nodes in CCP.
- #1300: Fix CCP: avoid bad CCP transitions and unsettled values.
- #1304: Avoid static-duration variables of class type (with constructors).
- #1323: Fix folding of an insert composite feeding a composite extract.
- #1339: Fix CCP: Handle OpConstantNull boolean values as conditions.
- #1341: DCEInst: Keep atomic instructions (and some others with side effects).
- #1354: Don't fold integer division.
- #1357: Support OpConstantNull in folding.
- #1361: CCP: Fix handling of non-constant module-scope values
v2018.0 2018-02-02
- General
- VisualStudio 2013 is no longer supported. VisualStudio 2015 is supported.
- Use "include/unified1" directory from SPIRV-Headers. Requires recent SPIRV-Headers source.
- Disassembler: spirv-dis adds --color option to force color disassembly.
- Optimizer:
- Add pass to eliminate dead insertions.
- Aggressive dead code elimination now removes OpSwitch constructs.
- Block merging occurs in more cases.
- Add driver workaround transform: replace OpUnreachable with harmless branch to merge.
- Improve instruction folding framework.
- Add loop analysis.
- Add scalar replacement of aggregates to size-optimization recipe.
- Add pass to replace instructions invalid for a shader stage, with a harmless value.
This changes the semantics of the program! Not for general use!
- Rearragne and add passes to performance-optimization recipe, to produce better results.
- Validator:
- Validate OpenCL extended instructions.
- Shaders can't perform atomics on floats.
- Validate memory semantics values in atomics.
- Validate instruction-adjacency constraints, e.g. OpPhi predecessors, merge instructions
immediately precede branches.
- Fixes:
- PR 1198: Optimizer: Fix CCP in presence of matrix constants.
- #1199: Optimizer: Fix CCP: don't propagate spec constants.
- #1203: Optimizer: Fix common uniform elim bug introduced by refactoring.
- #1210: Optimizer: Aggressive dead code elimination: Fix 'break' identification.
- #1212: Optimizer: Aggressive dead code elimination: Was skipping too many instructions.
- #1214: Optimizer: Aggressive dead code elimination: Fix infinite loop.
- #1228: Optimizer: Fix CCP: Handling of varying Phi nodes; was resulting in infinite loop.
- #1245: Optimizer: Dead branch elimination: Avoid a null pointer dereference.
- #1250: Optimizer: Dead branch elimination: Avoid spuriously reporting a change.
v2017.3 2018-01-12
- General:
- Support DebugInfo extended instruction set, targeted at OpenCL environments.
See the SPIR-V Registry.
- Generate a SPIRV-Tools.pc file for pkg-config.
- Optimizer:
- Progress for legalization of code generated from HLSL (issue #1118):
- Add --legalize-hlsl option to run transforms used to transform intermediate
code generated by HLSL to SPIR-V for Vulkan compilers. Those compilers
normally run these transforms automatically. This option is used for developing
those transforms.
- Add Private-to-Function variable conversion for modules with logical
addressing.
- Add --ccp: SSA Conditional Constant Propagation (CCP)
- Add --print-all to show disassembly for each optimization pass.