Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Modified Experiment Scripts Added #4

Open
wants to merge 9 commits into
base: aaai
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2,289 changes: 0 additions & 2,289 deletions agent/133.policy

This file was deleted.

674 changes: 0 additions & 674 deletions agent/133.pomdp

This file was deleted.

4,087 changes: 2,376 additions & 1,711 deletions agent/144.policy

Large diffs are not rendered by default.

864 changes: 432 additions & 432 deletions agent/144.pomdp

Large diffs are not rendered by default.

4,244 changes: 2,073 additions & 2,171 deletions agent/155.policy

Large diffs are not rendered by default.

650 changes: 325 additions & 325 deletions agent/155.pomdp

Large diffs are not rendered by default.

3,831 changes: 1,984 additions & 1,847 deletions agent/166.policy

Large diffs are not rendered by default.

1,080 changes: 540 additions & 540 deletions agent/166.pomdp

Large diffs are not rendered by default.

3,053 changes: 1,758 additions & 1,295 deletions agent/177.policy

Large diffs are not rendered by default.

6,468 changes: 3,234 additions & 3,234 deletions agent/177.pomdp

Large diffs are not rendered by default.

Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified agent/Plots/all_5_trials_entropy_experiment_model_133.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
50 changes: 50 additions & 0 deletions agent/Plots_data/50Trial_Belief_Entr_133.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
,Belief,Entropy,Overall Cost,Overall Success,Overall Reward,Precision,Recall
1,0.3,2.0,15.48,0.58,84.52,0.5434782608695652,1.0
2,0.3,3.0,16.43,0.58,71.57,0.5777777777777777,1.0
3,0.3,4.0,17.67,0.58,78.33,0.5652173913043478,1.0
4,0.3,5.0,16.78,0.6,79.22,0.5476190476190477,1.0
5,0.3,6.0,17.73,0.48,78.27,0.40476190476190477,0.9444444444444444
6,0.3,7.0,15.38,0.64,80.62,0.5714285714285714,1.0
7,0.3,8.0,16.22,0.68,71.78,0.6744186046511628,1.0
8,0.4,2.0,17.08,0.5,78.92,0.45652173913043476,1.0
9,0.4,3.0,19.15,0.56,68.85,0.5581395348837209,0.96
10,0.4,4.0,21.0,0.62,71.0,0.5813953488372093,1.0
11,0.4,5.0,18.53,0.76,69.47,0.7352941176470589,1.0
12,0.4,6.0,19.01,0.8,76.99,0.7428571428571429,0.9629629629629629
13,0.4,7.0,20.0,0.84,80.0,0.7647058823529411,1.0
14,0.4,8.0,19.88,0.78,72.12,0.7222222222222222,1.0
15,0.5,2.0,17.36,0.52,82.64,0.45454545454545453,1.0
16,0.5,3.0,19.61,0.58,68.39,0.5681818181818182,1.0
17,0.5,4.0,21.01,0.6,78.99,0.5348837209302325,1.0
18,0.5,5.0,19.57,0.64,72.43,0.5277777777777778,1.0
19,0.5,6.0,19.73,0.78,64.27,0.7777777777777778,0.9655172413793104
20,0.5,7.0,21.7,0.86,74.3,0.85,1.0
21,0.5,8.0,18.41,0.88,81.59,0.7777777777777778,1.0
22,0.6,2.0,17.7,0.6,82.3,0.574468085106383,1.0
23,0.6,3.0,17.5,0.7,78.5,0.6410256410256411,1.0
24,0.6,4.0,21.22,0.54,70.78,0.4888888888888889,1.0
25,0.6,5.0,17.83,0.86,70.17,0.8620689655172413,1.0
26,0.6,6.0,20.02,0.76,79.98,0.6571428571428571,1.0
27,0.6,7.0,19.95,0.8,80.05,0.696969696969697,1.0
28,0.6,8.0,22.26,0.86,69.74,0.84375,0.9642857142857143
29,0.7,2.0,18.79,0.54,73.21,0.5111111111111111,1.0
30,0.7,3.0,18.83,0.7,77.17,0.6585365853658537,1.0
31,0.7,4.0,19.14,0.7,80.86,0.6341463414634146,1.0
32,0.7,5.0,20.03,0.72,71.97,0.6388888888888888,1.0
33,0.7,6.0,20.98,0.78,79.02,0.7105263157894737,1.0
34,0.7,7.0,19.46,0.78,68.54,0.7333333333333333,0.88
35,0.7,8.0,20.42,0.8,71.58,0.75,0.96
36,0.8,2.0,17.5,0.52,78.5,0.4888888888888889,1.0
37,0.8,3.0,19.88,0.74,76.12,0.7142857142857143,1.0
38,0.8,4.0,18.98,0.7,77.02,0.6585365853658537,1.0
39,0.8,5.0,19.05,0.7,72.95,0.6486486486486487,0.96
40,0.8,6.0,19.09,0.86,76.91,0.8064516129032258,0.9615384615384616
41,0.8,7.0,19.35,0.86,72.65,0.8214285714285714,0.9583333333333334
42,0.8,8.0,20.39,0.82,59.61,0.8076923076923077,0.84
43,0.9,2.0,18.1,0.5,69.9,0.45,0.9473684210526315
44,0.9,3.0,18.64,0.6,77.36,0.47368421052631576,1.0
45,0.9,4.0,18.79,0.62,73.21,0.5952380952380952,1.0
46,0.9,5.0,21.04,0.7,74.96,0.6666666666666666,1.0
47,0.9,6.0,18.66,0.7,73.34,0.5806451612903226,0.9473684210526315
48,0.9,7.0,19.17,0.88,76.83,0.84375,0.9642857142857143
49,0.9,8.0,22.17,0.82,73.83,0.7647058823529411,1.0
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score
2,13.6162,0.3686,6.6704,0.5002,1.0,0.6668444207439008
3,14.1266,0.39,5.6624,0.5119072708113804,0.9700479233226837,0.6701614015726307
4,14.5574,0.4142,5.3706,0.5292812777284827,0.9509764846552411,0.6800627048596266
5,15.0809,0.4432,3.9007,0.5493300852618758,0.9081755940394683,0.6845780206435944
6,15.1797,0.4756,3.4961,0.5763870628451223,0.8778534241089307,0.6958730158730159
7,15.3858,0.4948,1.603,0.5995267672286306,0.8140562248995984,0.6905126894907171
8,15.5766,0.515,-0.0274,0.6275794300687848,0.7763371150729336,0.6940771599347945
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score
2,16.678,0.472,72.922,0.498,1.0,0.664886515353805
3,17.642,0.47,75.558,0.488,1.0,0.6559139784946236
4,18.365,0.516,74.435,0.536,1.0,0.6979166666666667
5,18.547,0.448,74.253,0.464,1.0,0.6338797814207651
6,19.974,0.484,75.226,0.502,1.0,0.6684420772303595
7,19.87,0.522,73.73,0.5210970464135021,0.9959677419354839,0.6842105263157894
8,20.258,0.524,73.342,0.5157894736842106,1.0,0.6805555555555556
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
,Overall Cost,Overall Success,Overall Reward,Precision,Recall
0.3,16.302,0.64,77.698,0.605080831408776,0.9961977186311787
0.4,18.938,0.85,73.862,0.8092105263157895,0.9647058823529412
0.5,20.569,0.86,70.631,0.8213058419243986,0.956
0.6,20.794,0.864,68.806,0.852112676056338,0.937984496124031
0.7,21.253,0.864,68.047,0.8458904109589042,0.9573643410852714
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score
2,16.61,0.5,83.39,0.5,1.0,0.6666666666666666
3,17.1,0.52,74.9,0.54,1.0,0.7012987012987013
4,18.1,0.54,77.9,0.54,1.0,0.7012987012987013
5,19.8,0.38,76.2,0.38,1.0,0.5507246376811594
6,19.09,0.54,72.91,0.56,1.0,0.717948717948718
7,18.51,0.42,65.49,0.41304347826086957,1.0,0.5846153846153846
8,20.42,0.5,71.58,0.4888888888888889,1.0,0.6567164179104478
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score
2,15.0,0.4,85.0,0.4,1.0,0.5714285714285715
3,14.9,0.4,85.1,0.4,1.0,0.5714285714285715
4,17.3,0.2,82.7,0.2,1.0,0.33333333333333337
5,14.8,0.6,85.2,0.6,1.0,0.7499999999999999
6,18.9,0.4,41.1,0.6,1.0,0.7499999999999999
7,21.8,0.6,78.2,0.6,1.0,0.7499999999999999
8,17.0,0.2,83.0,0.2,1.0,0.33333333333333337
14 changes: 7 additions & 7 deletions agent/Plots_data/all5_trials_entropy_experiment_model_133.csv
Original file line number Diff line number Diff line change
@@ -1,8 +1,8 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score
2,16.8,0.6,83.2,0.5,1.0,0.6666666666666666
3,18.0,0.8,82.0,0.6666666666666666,1.0,0.8
4,18.1,0.8,81.9,0.75,1.0,0.8571428571428571
5,19.2,1.0,80.8,1.0,1.0,1.0
6,17.7,0.8,82.3,0.75,1.0,0.8571428571428571
7,9.6,1.0,90.4,0.0,0.0,0.0
8,23.4,0.8,76.6,0.75,1.0,0.8571428571428571
2,15.8,0.6,24.0,0.5,1.0,0.6666666666666666
3,11.4,0.6,8.0,1.0,0.75,0.8571428571428571
4,18.7,0.8,40.0,0.75,1.0,0.8571428571428571
5,24.8,0.6,24.0,0.8,1.0,0.888888888888889
6,14.1,0.8,40.0,0.0,0.0,0.0
7,16.8,1.0,40.0,1.0,1.0,1.0
8,24.0,0.8,24.0,1.0,1.0,1.0
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,15.97805,0.4692,-14.06285,0.7054961545676234,0.7525010004001601,0.7282408752057314,6.305852912553872,0.7193
155,12.38865,0.3562,-21.71415,0.6791595684270301,0.4785914365746298,0.5615023474178403,5.841315453384419,0.6264
166,10.3644,0.2726,-26.5271,0.636406396989652,0.2707082833133253,0.37984278495227397,5.798280186313149,0.5582
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,20.3862,0.4254,-19.56105,0.7239673509769973,0.5856342537014806,0.6474947461563987,8.771613092617056,0.6813
155,14.6078,0.3278,-24.86845,0.7080523601745339,0.35714285714285715,0.4747971804761271,7.504544703354817,0.6051
166,11.8439,0.2548,-28.3301,0.6686701728024043,0.17807122849139656,0.2812450624111234,7.279398275545772,0.5451
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,16.29535,0.46,-15.53245,0.7007843137254902,0.7150860344137655,0.7078629431570609,6.854468085106383,0.705
155,13.3477,0.3502,-23.067,0.6733847065797274,0.4545818327330932,0.5427615862398472,6.563350615683733,0.6172
166,10.5638,0.2591,-28.06905,0.6211180124223602,0.2000800320128051,0.3026634382566586,6.479413946587537,0.5392
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,Efficiency,Accuracy
144,16.46,0.462,-12.952,0.677758318739,0.784989858012,0.727443609023,14.7441988592,5.92535211268,0.71
155,12.6355,0.376,-19.797,0.688172043011,0.519269776876,0.591907514451,11.2645412579,5.6398763524,0.647
166,10.1925,0.272,-26.4915,0.611940298507,0.249492900609,0.35446685879,8.76368608235,6.09782608696,0.552
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,Efficiency,Accuracy
144,20.263,0.43,-18.8875,0.731070496084,0.567951318458,0.639269406393,19.2364843722,8.53801169591,0.684
155,15.387,0.351,-23.189,0.741035856574,0.377281947262,0.5,16.3466580988,7.89012738854,0.628
166,11.417,0.243,-29.246,0.583333333333,0.0993914807302,0.169844020797,12.6515655553,7.443378119,0.521
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,Efficiency,Accuracy
144,16.431,0.459,-13.4805,0.675,0.766734279919,0.717948717949,14.2723942981,6.15931721195,0.703
155,13.0655,0.368,-20.9105,0.684357541899,0.496957403651,0.575793184489,11.4761909948,6.11267605634,0.639
166,10.486,0.272,-26.7115,0.607329842932,0.235294117647,0.33918128655,9.4250890712,6.30109489051,0.548
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,RewardStd,Efficiency,Accuracy
144,21.06,0.42,-9.27,0.654545454545,0.72,0.685714285714,19.2108406896,94.0074310892,6.58208955224,0.67
155,17.065,0.33,-33.2,0.632653061224,0.62,0.626262626263,15.099280612,95.1339319065,6.68253968254,0.63
166,13.73,0.32,-42.785,0.59375,0.38,0.463414634146,11.6147363293,94.0269975858,6.33928571429,0.56
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,RewardStd,Efficiency,Accuracy
144,22.75,0.41,-15.005,0.725,0.58,0.644444444444,20.7442160614,91.9892791308,9.25,0.68
155,19.32,0.28,-43.5,0.606060606061,0.4,0.481927710843,17.7986684895,91.4711976526,8.56140350877,0.57
166,16.435,0.28,-49.57,0.769230769231,0.2,0.31746031746,17.1893360837,89.0961003636,10.3684210526,0.57
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,RewardStd,Efficiency,Accuracy
144,21.255,0.43,-9.48,0.666666666667,0.72,0.692307692308,19.1300150287,93.8854067467,6.83823529412,0.68
155,18.28,0.35,-32.425,0.659090909091,0.58,0.617021276596,16.5041388748,94.239757401,7.515625,0.64
166,14.225,0.3,-47.275,0.58064516129,0.36,0.444444444444,11.8176509933,92.2634373682,7.70909090909,0.55
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,13.8,0.9,18.2,1.0,1.0,1.0,9.25,1.0
155,12.85,0.6,3.15,0.6666666666666666,0.5,0.5714285714285715,10.333333333333334,0.7
166,17.7,0.3,-25.7,0.75,0.5,0.6,14.0,0.6
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,23.8,0.8,16.2,0.6666666666666666,1.0,0.8,15.666666666666666,0.8
155,16.7,0.4,-24.7,0.0,0.0,0.0,50.0,0.4
166,12.4,0.3,-24.5,0.0,0.0,0.0,50.0,0.5
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,19.85,0.6,-3.85,0.8888888888888888,1.0,0.9411764705882353,8.11111111111111,0.9
155,13.45,0.6,2.55,0.3333333333333333,0.3333333333333333,0.3333333333333333,10.666666666666666,0.6
166,15.95,0.4,-15.95,0.0,0.0,0.0,10.5,0.5
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,18.5,0.0,-58.5,0.0,0.0,0.0,50.0,0.0
155,23.0,0.0,17.0,0.0,0.0,0.0,50.0,0.0
166,7.5,0.0,32.5,0.0,0.0,0.0,50.0,0.0
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,22.0,1.0,18.0,0.0,0.0,0.0,18.0,1.0
155,8.5,0.0,-48.5,0.0,0.0,0.0,6.0,1.0
166,9.0,0.0,31.0,0.0,0.0,0.0,50.0,0.0
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,20.0,0.0,20.0,0.0,0.0,0.0,50.0,0.0
155,11.5,0.0,-51.5,0.0,0.0,0.0,10.0,1.0
166,13.5,1.0,26.5,0.0,0.0,0.0,11.0,1.0
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,RewardStd,Efficiency,Accuracy
144,15.27825,0.457,-9.8705,0.714151827554,0.75,0.73163706193,14.5289780073,42.1814471036,5.88341429563,0.7205
155,12.44925,0.367,-16.75525,0.679851668727,0.541338582677,0.602739726027,11.0339385279,40.7515824532,5.80784313725,0.6375
166,10.187,0.281,-23.85675,0.653846153846,0.301181102362,0.412398921833,8.73244129668,37.6400598357,6.45124113475,0.564
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,RewardStd,Efficiency,Accuracy
144,17.799,0.4345,-14.30475,0.754057428215,0.594488188976,0.664832140892,17.7697945683,42.9184954587,7.93026599569,0.6955
155,14.59825,0.348,-21.084,0.71,0.419291338583,0.527227722772,14.7581916893,41.1778179849,7.54207119741,0.618
166,12.1155,0.263,-27.16875,0.692307692308,0.186023622047,0.293250581846,13.5844915161,37.3605727798,7.97612488522,0.5445
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,RewardStd,Efficiency,Accuracy
144,15.623,0.462,-10.09175,0.71619047619,0.740157480315,0.727976766699,14.3707818507,42.1586433242,6.16828929068,0.719
155,12.4555,0.3565,-18.05625,0.665306122449,0.481299212598,0.558537978298,10.5652032517,40.4389504183,6.26894865526,0.6135
166,10.3215,0.279,-24.3095,0.644230769231,0.263779527559,0.374301675978,8.66161865646,37.4910217219,6.77173913043,0.552
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,17.25,0.0,-17.25,0.0,0.0,0.0,50.0,0.0
155,14.75,0.5,25.25,0.0,0.0,0.0,7.0,0.5
166,15.5,0.0,-15.5,0.0,0.0,0.0,50.0,0.0
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,18.0,1.0,22.0,0.0,0.0,0.0,15.0,1.0
155,12.75,1.0,27.25,0.0,0.0,0.0,10.0,1.0
166,7.5,0.0,-7.5,0.0,0.0,0.0,50.0,0.0
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,11.25,0.5,-11.25,0.0,0.0,0.0,9.0,1.0
155,16.5,0.5,-16.5,0.0,0.0,0.0,7.0,0.5
166,7.25,0.0,-47.25,0.0,0.0,0.0,5.0,1.0
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,RewardStd,Efficiency,Accuracy
144,15.6056666667,0.419333333333,-13.1348333333,0.719274680994,0.698174706649,0.708567648032,14.2659969118,42.0948005891,6.60122699387,0.706333333333
155,12.8431666667,0.349333333333,-19.8668333333,0.687613843352,0.492177314211,0.573708206687,10.733762309,40.3476147577,6.6214057508,0.626
166,10.0588333333,0.258,-26.4326666667,0.643435980551,0.258800521512,0.369130636913,8.40869323809,36.5831673436,6.55021302495,0.547666666667
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,RewardStd,Efficiency,Accuracy
144,17.802,0.401666666667,-16.0643333333,0.731942215088,0.594524119948,0.656115107914,17.4070520958,42.6809816494,8.09589041096,0.681333333333
155,15.0426666667,0.332,-22.9166666667,0.692216981132,0.382659713168,0.492863140218,14.8941379818,40.581229925,7.88113839286,0.597333333333
166,12.0938333333,0.241666666667,-28.7416666667,0.613466334165,0.16036505867,0.254263565891,13.6943277298,36.493598944,7.6859344894,0.519
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,RewardStd,Efficiency,Accuracy
144,16.6138333333,0.401333333333,-16.3673333333,0.705972434916,0.601043024772,0.649295774648,14.3092321704,41.7073362798,8.00349301397,0.668
155,13.2951666667,0.342333333333,-21.1301666667,0.669696969697,0.432203389831,0.525356576862,10.7855826595,40.1617206633,7.37291897891,0.600666666667
166,10.1811666667,0.252,-27.579,0.646341463415,0.207301173403,0.31391905232,8.38181137775,36.1965549788,6.93850931677,0.536666666667
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,16.075875,0.67475,11.064125,0.6988727858293076,0.875882946518668,0.7774294670846394,7.8576180971390555,0.7515
155,15.334375,0.58375,0.599125,0.6940524193548387,0.6968623481781376,0.6954545454545455,7.968146027201145,0.6985
166,11.829125,0.424,-4.397625,0.6069672131147541,0.7249143416544298,0.6607182690162837,4.265026220250101,0.61975
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,19.006125,0.67475,-4.011375,0.9551954242135366,0.5002496255616575,0.6566186107470511,12.71239837398374,0.738
155,15.70175,0.53625,-11.5835,0.8753709198813057,0.2981303688731683,0.44477949491142105,9.808468539770479,0.63175
166,11.69525,0.42725,-3.78525,0.5805776475512767,0.7120123203285421,0.6396126354623012,4.295855560114895,0.60925
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,16.31725,0.6635,9.96275,0.6945773524720893,0.8688279301745636,0.771992023044538,7.913833726018176,0.74275
155,15.340375,0.57425,-0.594625,0.7075705096486887,0.7016683022571149,0.7046070460704607,8.099607283113174,0.70025
166,12.540125,0.43875,-14.406125,0.7527593818984547,0.34305835010060365,0.47131997235659995,7.850202429149798,0.6175
4 changes: 4 additions & 0 deletions agent/Plots_data/all_5000_baseline1.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,20.1735,0.413,-20.4117,0.7140680548501778,0.5594906486271389,0.6273984828201696,8.606606606606606,0.666
155,14.9643,0.3256,-25.2868,0.7158555729984302,0.36291285316354954,0.4816477422762081,7.7691801119525845,0.6074
166,11.7094,0.25,-28.7488,0.6578538102643857,0.16832471150019895,0.2680608365019011,7.288104089219331,0.538
4 changes: 4 additions & 0 deletions agent/Plots_data/all_5000_baseline2.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,16.215,0.4468,-16.4847,0.6962226640159046,0.6967767608436132,0.6964996022275258,6.861830742659758,0.6948
155,13.4027,0.3498,-23.0227,0.6729559748427673,0.46836450457620377,0.5523228531206006,6.603816300129366,0.6184
166,10.4842,0.2516,-28.7078,0.6189873417721519,0.1945881416633506,0.296094459582198,6.564485981308411,0.535
4 changes: 4 additions & 0 deletions agent/Plots_data/all_5000_pomdpdual.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,15.8931,0.4554,-15.1249,0.7012590614269363,0.7313967369677676,0.716010907674328,6.315923207227555,0.7084
155,12.6773,0.355,-21.875,0.6797133406835723,0.49064862713887786,0.5699098682690086,5.8834023574386745,0.6278
166,10.2841,0.2674,-27.0446,0.6303088803088803,0.25984878631118186,0.3679909833755988,5.98657961552412,0.5514
4 changes: 4 additions & 0 deletions agent/Plots_data/all_5000_sigm_baseline1.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,20.6136,0.4116,-20.6444,0.719493670886076,0.5654596100278552,0.6332442067736186,8.555754323196183,0.6708
155,14.8927,0.3244,-25.2576,0.7059301380991064,0.3458018304814962,0.4642094017094017,7.666666666666667,0.5988
166,11.6582,0.24,-29.7819,0.6548856548856549,0.12534818941504178,0.21042084168336672,7.59597875569044,0.5272
4 changes: 4 additions & 0 deletions agent/Plots_data/all_5000_sigm_baseline2.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,16.232,0.4432,-16.7668,0.6968253968253968,0.6987664146438519,0.6977945559308563,6.845357861454441,0.6958
155,13.3665,0.348,-23.0846,0.6653013458162669,0.45244727417429365,0.538607295120796,6.565203145478375,0.6104
166,10.4973,0.253,-28.57,0.6150895140664961,0.19140469558296858,0.2919575113808801,6.523434570678665,0.5334
4 changes: 4 additions & 0 deletions agent/Plots_data/all_5000_sigm_pomdpdual.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,Efficiency,Accuracy
144,16.1003,0.4548,-15.2179,0.7036336109008327,0.7397532829287704,0.7212415130940835,6.294134156609599,0.7126
155,12.5152,0.3528,-21.99,0.6721218961625283,0.47393553521687226,0.5558926487747958,5.861155957378108,0.6194
166,10.3518,0.2598,-27.8714,0.628476084538376,0.2248308794269797,0.3311840562719813,6.184326710816777,0.5436
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,RewardStd,Efficiency,Accuracy
144,16.4198,0.4484,-14.0268,0.674768914755,0.784321528054,0.725432462275,14.5053392914,43.5593845429,5.88483466363,0.7016
155,12.469,0.3558,-21.5471,0.682232957595,0.505769996021,0.580895795247,10.7900713158,40.396404934,5.70972836387,0.6332
166,10.4704,0.2702,-26.7797,0.641476274165,0.290489454835,0.399890440975,9.62791378441,36.9414931738,5.86685653257,0.5618
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,RewardStd,Efficiency,Accuracy
144,20.455,0.4174,-20.1714,0.719919110212,0.566653402308,0.634157203295,19.2803520455,44.1542208859,8.64313375037,0.6714
155,15.0266,0.3212,-25.6395,0.717628705148,0.366096299244,0.484848484848,15.6340427414,40.0431565907,7.73596059113,0.609
166,11.8826,0.2442,-29.2162,0.646869983949,0.160366096299,0.257015306122,13.4107948027,36.0553097,7.26891385768,0.534
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
,QA Cost,Overall Success,Dialog Reward,Precision,Recall,F1 Score,QACostStd,RewardStd,Efficiency,Accuracy
144,16.4261,0.4444,-14.7749,0.673076923077,0.766016713092,0.71654569142,14.1246695108,43.3340637373,6.13028472821,0.6954
155,12.9793,0.35,-22.5093,0.675931072818,0.483883804218,0.56400742115,11.1444121204,40.3490924744,6.16282051282,0.624
166,10.6361,0.265,-27.5439,0.637104994903,0.24870672503,0.357756153406,9.70487129178,36.7312458649,6.32148040639,0.5512
Loading