Dev #26

dityas · 2019-10-31T08:25:18Z

Working POMDP solver and IPOMDP solver.
POMDP policies and solutions can be exported to JSON and dot files.
Same functionality for IPOMDPs just another commit away

To be fixed:

thinclab.belief

Belief and InteractiveBelief should have uniform API for belief updates and other belief ops
SSGAExpansion should start from alpha vector policy consisting of reward functions instead of performing a full belief expansion first
All expansion strategies should be compatible with both POMDP and IPOMDPs

thinclab.decisionprocesses

Unify POMDP and IPOMDP DD vars and the way dynamics are stored
Maintain a static belief update method which calls thinclab.belief functions

Bring the thesis branch up to date with new changes in master

The camper attacker domain for L0 is built

A minimal attacker domain has been started in l0. The idea is to have ATT&CK techniques as actions taken by the attacker. Need to fix CREDS_DISCOVERY. It should succeed only when HAS_CREDS has been confirmed through observations

The minimal persistence domain is solvable. THe policy is not particularly sophisticated. But it looks like it will work for data exfils

The IPOMDP solver oscillates even for fixed belief tree depths. The POMDP solver was checked to see if it behaved similarly. It converges. This implies a probable bug in the IPOMDP solver code and not in the algorithm itself.

A lot of the attributes in the POMDP class are not used once the policy is obtained. These are set to null to save on memory

The L1 domain file for the defender is created. But testing it would require a verified L1 solver. Also the bellman error being reported was printed for up to 9 decimal points. This is cut down to 3 to make space for other important metrics

Thesis

The value iteration solver is now compatible with the general OnlineSolver API. Some tests still need to be added to account for unforeseen args and edge cases

Confirm ipomdp solver convergence

The L1 solvers had to implement oneStepNZPrimeBelStates and dpBackup every time. Also these functions could potentially be useful outside the solver itself. So the functions are re factored into other classes as static methods.

The OnlineSolver keeps a track of the bellman error values for the few most recent iterations and determines if approximate convergence should be declared for low values of error variance

IPOMDP objects are now serializable. All the enclosed member objects also implement serialization. The parser objects for both, POMDPs and IPOMDPs are nulled out after parsing is done.

The OnlinePolicyTree object computes a static policy tree for the IPOMDP for a given horizon. The implementation has been tested on OnlineValueIteration and OnlineIPBVI solvers

The OnlinePolicyTree represents the beliefs at the same horizon as hash sets to avoid making extra nodes for repeated beliefs.

The OnlinePolicyTree object can now export the policy to a JSON string. The getDotString and getJSONString methods give the dot format and JSON formatted tree.

Fix the test domain file paths in the OpponentModel tests. Also log at debug level when context is changed

Online policy representations

The POMDP solvers previously implemented in the POMDp class itself are now separate classes compatible with the OfflineSolver API.

Dev

The policy can be better represented as trees rather than graphs. Trees also allow for each node to maintain belief states. The StructuredTree class provides a base for implementing PolicyTrees and BeliefTrees

The LookAheadTree is no longer used for belief expansion. Instead, the FullInteractiveBeliefExpansion does the same and is compatible with the general BeliefExpansion API.

The OpponentModel API built the belief tree offline. This was inefficient in term of memory and computation since it included zero probability nodes in then belief tree. This has been fixed with online belief tree creation in MJ

The newly implemented solver objects and belief search classes are made serializable. Old tests are removed

Most of the implementations in the legacy code are no longer used. These are commented out to see if the solvers still work

The older solvers implemented in the POMDP class are replaced with the newer solver implementations

The factored interactive belief built using sumouts might be wrong. Unfactoring cannot be done by simply multiplying back the factors.

The older unit tests do not assert any values from the output. They simply print the objects to terminal. A few broad but strict assertions have been added to ensure correctness

The older DDTree functionality was mainly aimed towards creating SPUDD files from UI. But more recently, it is being use to generate and persist actual DDs from symbolic perseus.

Online policy representations

Remove old API files and re factor relevant parts into other packages. PolicyNode has been moved in the thinclab.policy package

dityas added 30 commits October 11, 2019 20:03

branching for thinclab's work

1891fcc

Merge pull request #18 from dityas/master

10f1e73

Bring the thesis branch up to date with new changes in master

Make large enough domain to model camper attacker type

7accec8

The camper attacker domain for L0 is built

Start a minimal attacker domain

ecdd4dc

A minimal attacker domain has been started in l0. The idea is to have ATT&CK techniques as actions taken by the attacker. Need to fix CREDS_DISCOVERY. It should succeed only when HAS_CREDS has been confirmed through observations

Made camper for minimal domain

bad4d23

The minimal persistence domain is solvable. THe policy is not particularly sophisticated. But it looks like it will work for data exfils

Make honeypot camper attacker with noiseless transitions

d628998

Test solver convergence on restricted belief trees

3086b4c

The IPOMDP solver oscillates even for fixed belief tree depths. The POMDP solver was checked to see if it behaved similarly. It converges. This implies a probable bug in the IPOMDP solver code and not in the algorithm itself.

Make domain for data exfil attacker

0d59025

Fix attacker domain to have less branching on belief tree

7be9186

Mirror changes in L0 to new L1 domain

3cb80f6

Null out memory heavy variables

f6f5252

A lot of the attributes in the POMDP class are not used once the policy is obtained. These are set to null to save on memory

Further prune the data exfil domain

767ce05

Revised CREDS_RECON observation function

67c059b

Use Protos.jar from thesis to avoid merge conflict

1cbbcfb

Merge pull request #19 from dityas/thesis

f2a83e7

Thesis

Restrict belief expansion on POMDP solver to confirm convergence

74cea32

Build jar file

899f7bf

Implement a rough value iteration solver cos why not

6e8ae22

Finish Value Iteration solver

e4a2754

The value iteration solver is now compatible with the general OnlineSolver API. Some tests still need to be added to account for unforeseen args and edge cases

Restore POMDP belief expansion

f364889

Merge pull request #20 from dityas/confirm_ipomdp_solver_convergence

7d09580

Confirm ipomdp solver convergence

Fine tune POMDP belief expansion

4b50778

Measure variance in bellman error to determine approximate convergence

ff26c5a

The OnlineSolver keeps a track of the bellman error values for the few most recent iterations and determines if approximate convergence should be declared for low values of error variance

Add serialization functions for IPOMDP

1d6e861

IPOMDP objects are now serializable. All the enclosed member objects also implement serialization. The parser objects for both, POMDPs and IPOMDPs are nulled out after parsing is done.

Implement static policy creation for fixed horizon

5fc2246

The OnlinePolicyTree object computes a static policy tree for the IPOMDP for a given horizon. The implementation has been tested on OnlineValueIteration and OnlineIPBVI solvers

Implement belief sets for same time step in OnlinePolicyTree

a56b608

The OnlinePolicyTree represents the beliefs at the same horizon as hash sets to avoid making extra nodes for repeated beliefs.

Export OnlinePolicyTree to JSON string

8635aae

The OnlinePolicyTree object can now export the policy to a JSON string. The getDotString and getJSONString methods give the dot format and JSON formatted tree.

Fix OpponentModel tests

662bb05

Fix the test domain file paths in the OpponentModel tests. Also log at debug level when context is changed

dityas added 25 commits October 22, 2019 13:51

Merge pull request #23 from dityas/online_policy_representations

a9388f5

Online policy representations

Refactor POMDP solvers into separate classes

3fcfe4b

The POMDP solvers previously implemented in the POMDp class itself are now separate classes compatible with the OfflineSolver API.

Merge pull request #24 from dityas/dev

e846b8b

Dev

Fix tests for DDOps

a1a65bd

Implement structured trees for policy representation

e0db76d

The policy can be better represented as trees rather than graphs. Trees also allow for each node to maintain belief states. The StructuredTree class provides a base for implementing PolicyTrees and BeliefTrees

Delete old classes and API

45df01d

The LookAheadTree is no longer used for belief expansion. Instead, the FullInteractiveBeliefExpansion does the same and is compatible with the general BeliefExpansion API.

Add optimal actions to StaticBeliefTree

5cdd437

Transition IPOMDP from OpponentModel API to new MJ API

2ffe6c5

The OpponentModel API built the belief tree offline. This was inefficient in term of memory and computation since it included zero probability nodes in then belief tree. This has been fixed with online belief tree creation in MJ

Make everything serializable and delete old API

cc9f870

The newly implemented solver objects and belief search classes are made serializable. Old tests are removed

Comment out unused functions and classes

cf06723

Most of the implementations in the legacy code are no longer used. These are commented out to see if the solvers still work

Test POMDP solver using new API

6232d1f

The older solvers implemented in the POMDP class are replaced with the newer solver implementations

rafactor parsers and delete old structures

3fd34b8

Debug belief update problems

6b97318

Discover a bug in factoring interactive beliefs

b55fa71

The factored interactive belief built using sumouts might be wrong. Unfactoring cannot be done by simply multiplying back the factors.

Fix dynamic belief update

470c913

Fix tests for belief expansion and ddmaker

8623c89

The older unit tests do not assert any values from the output. They simply print the objects to terminal. A few broad but strict assertions have been added to ensure correctness

remove older DDTree functionality and rename the DDTree package

eb77ad9

The older DDTree functionality was mainly aimed towards creating SPUDD files from UI. But more recently, it is being use to generate and persist actual DDs from symbolic perseus.

Fix old API call in BeliefUpdateViewer

7ef5099

Make DDTreeLeaf serializable

6ed23ce

Merge pull request #25 from dityas/online_policy_representations

ab319c4

Online policy representations

Add conditional plan graph creation

a42b5cf

Remove redundant files

b260087

Remove old API files and re factor relevant parts into other packages. PolicyNode has been moved in the thinclab.policy package

Implement policy graph creation

12c6ad4

Write dot strings and JSON strings to files in StructuredTree

e76ab2e

Re factor policy representations and remove commented code

2e667fd

dityas merged commit c602008 into master Oct 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dev #26

Dev #26

dityas commented Oct 31, 2019

Dev #26

Dev #26

Conversation

dityas commented Oct 31, 2019

thinclab.belief

thinclab.decisionprocesses