Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
ResNet50_tensorflow
Commits
25a4d743
Commit
25a4d743
authored
Jul 23, 2018
by
Carlos Riquelme
Browse files
Readme changes.
parent
fdcd0168
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
research/deep_contextual_bandits/README.md
research/deep_contextual_bandits/README.md
+2
-2
No files found.
research/deep_contextual_bandits/README.md
View file @
25a4d743
...
...
@@ -101,7 +101,7 @@ The Deep Bayesian Bandits library includes the following algorithms (see the
action, and ---simultaneously--- it updates a Bayesian linear regression in
the last layer (i.e., the one that maps the final representation
**z**
to
the rewards
**r**
). Thompson Sampling samples the linear parameters
!
[
equation
](
https://latex.codecogs.com/gif.download?%5Cinline%20%5Cbeta_i
)
for each action
*i*
, but keeps the network that computes the
β
<sub>
i
</sub>
for each action
*i*
, but keeps the network that computes the
representation. Then, both parts (network and Bayesian linear regression)
are updated, possibly at different frequencies. The algorithm is implemented
in
*neural_linear_sampling.py*
, and we create an algorithm instance like
...
...
@@ -117,7 +117,7 @@ The Deep Bayesian Bandits library includes the following algorithms (see the
explicitly use Thompson Sampling. However, due to stochastic gradient
descent, there is still some randomness in its output. It is
straight-forward to add epsilon-greedy exploration to choose random
actions with probability
!
[
equation
](
https://latex.codecogs.com/gif.download?%5Cinline%20%5Cepsilon%20%5Cin%20%5B0%2C%201%5D
)
. The algorithm is
actions with probability
ε
∈
(0, 1
). The algorithm is
implemented in
*neural_bandit_model.py*
, and it is used together with
*PosteriorBNNSampling*
(defined in
*posterior_bnn_sampling.py*
) by calling:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment