ThompsonSampling
Extends:
Implementation of the Thompson Sampling algorithm
Constructor Summary
Public Constructor | ||
public |
constructor(options: Object): this creates a new instance of the Thompson Sampling(TS) algorithm. |
Member Summary
Public Members | ||
public |
|
|
public |
|
|
public |
|
Method Summary
Public Methods | ||
public |
learn(tsRow: Object, getBound: Function): this single step trainning method |
|
public |
predict(): number returns next action based off of the thompson sampling |
|
public |
train(tsRow: Object | Object[], getBound: Function): this training method for thompson sampling calculations |
Inherited Summary
From class ReinforcedLearningBase | ||
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
|
|
public |
learn() interface instance method for reinforced learning step |
|
public |
predict() interface instance method for reinforced prediction step |
|
public |
train() interface instance method for reinforced training step |
Public Constructors
public constructor(options: Object): this source
creates a new instance of the Thompson Sampling(TS) algorithm. TS a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists in choosing the action that maximizes the expected reward with respect to a randomly drawn belief
Override:
ReinforcedLearningBase#constructorParams:
Name | Type | Attribute | Description |
options | Object |
|
Return:
this |
Example:
const dataset = new ms.ml.ThompsonSampling({bounds:10});
See:
- https://en.wikipedia.org/wiki/Thompson_sampling
Public Methods
public learn(tsRow: Object, getBound: Function): this source
single step trainning method
Override:
ReinforcedLearningBase#learnParams:
Name | Type | Attribute | Description |
tsRow | Object | row of bound selections |
|
getBound | Function |
|
select value of tsRow by selection value |
Return:
this |
public predict(): number source
returns next action based off of the thompson sampling
Override:
ReinforcedLearningBase#predictReturn:
number | returns thompson sample |
public train(tsRow: Object | Object[], getBound: Function): this source
training method for thompson sampling calculations
Override:
ReinforcedLearningBase#trainParams:
Name | Type | Attribute | Description |
tsRow | Object | Object[] | row of bound selections |
|
getBound | Function |
|
select value of tsRow by selection value |
Return:
this |