Inverse Optimal Control with Suboptimality Loss

The project is an implementation of the paper Learning for Control: An Inverse Optimization Approach, co-authored by Syed Adnan Akhtar, Arman Sharifi Kolarijani, and Peyman Mohajerin Esfahani at TU Delft, Netherlands. Please refer to the paper for more details

Learning for Control: An Inverse Optimization Approach [Paper]
Demonstration Video

The repo has been tested to be working on MATLAB 2018-2020 (Ubuntu 16.04 & Windows 10).

The python code is currently under construction.

The code requires installation of the following

MOSEK - A toolbox to solve LPs, QPs, SOCPs, SDPs and MIPs. Please find more information here.
YALMIP - A toolbox to provide an interfact to the solver. Please find more information here.
MATLAB.

How-to

This section explains how to use the code to learn a quadratic cost function from a demonstration data and its constraints. The code can be readilly executed due to the presence of an example data.

Clone the package locally on your computer

git clone https://github.com/syedadnanakhtar/inverseLearning_subOpt
Create a data object

d = data(fullfile(path,filename));

where path and filename are character array of the path and filename respectively.
Assign the state data x and the action data u

d.x = <State data>;

d.u = <Action data>;

Note that the data must be in a row format, where each row corresponds to the data pointing to a single time instance. The dimensions of the state and action data can be conveniently checked by d.nStates and d.nActions.
Define a feature list. For eg, if you want the features to be constructed as [x, x-a], where a is some constant vector, then use the following command

d.featureList = {'obj.x' 'obj.x - a'};

Note that x is prefixed with an obj. If you need more than one constant vectors in the feature list, use a{1}, a{2]...
Construct features
d.constructFeatures();
Define constraints The constraints are encoded in the structure const which has three fields: const.M,const.W, and const.L. The constraints take the form Mu <= Wx + L. Please refer to the paper for more info.
Learn the cost function theta by running the command

[obj, theta] = learnCostSubOpt(d,const);

The function accepts the data object d and constraints const, and returns the total objective function value (cost), as well as the cost function theta.
Forward simulate with the learned cost function

[xSeq, objective] = simulate(x0,theta,d.featureList,const,simLength,a);

where x0 is the initial state, simLength is the simulated horizon. The rest of the variables have their usual meanings.

Example Dataset

The project contains an example data of a human that is reaching for a goal object. The data has 8 joint angles of human upper body including lumbar-extension, lumbar bending, lumbar-rotation, shoulder-adduction, shoulder-rotation, shoulder-flexion, elbow-flexion, and pronation-supination.

A simple kinematical state space is considered for learning : x(t+1) = Ax(t) + Bu(t)

The dataset containts the states x (joint angles), as well as u (change in joint angles).

Contact and Citation

Should you come across any bug in my code or have any question, please feel free to send me an email at syed.akhtar[at]tum[dot]de

If you find this project helpful, please consider citing our paper.

@ARTICLE{9336679,
  author={Akhtar, Syed Adnan and Kolarijani, Arman Sharifi and {Mohajerin Esfahani}, Peyman},
  journal={IEEE Control Systems Letters}, 
  title={Learning for Control: An Inverse Optimization Approach}, 
  year={2022},
  volume={6},
  number={},
  pages={187-192},
  doi={10.1109/LCSYS.2021.3050305}}

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
python		python
.gitignore		.gitignore
README.md		README.md
RM1_Healthy_0001.csv		RM1_Healthy_0001.csv
data.m		data.m
learnCostSubOpt.asv		learnCostSubOpt.asv
learnCostSubOpt.m		learnCostSubOpt.m
main.asv		main.asv
main.m		main.m
simulate.asv		simulate.asv
simulate.m		simulate.m
valueOfTraj.m		valueOfTraj.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Inverse Optimal Control with Suboptimality Loss

How-to

Example Dataset

Contact and Citation

About

Uh oh!

Releases

Packages

Languages

syedadnanakhtar/inverseLearning-subOpt

Folders and files

Latest commit

History

Repository files navigation

Inverse Optimal Control with Suboptimality Loss

How-to

Example Dataset

Contact and Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages