University of Southern California
Research Group

Algorithmic Experimental Game Theory
Dealing with Human Uncertainty in Critical Adversarial Domains

Maintained by James Pita and Rong Yang

 

Current Team

Milind Tambe

Richard John

Fernando Ordonez

Rong Yang

James Pita

Chris Kiekintveld

Mohit Goenka

 

Motivation

Security at major locations of economic or political importance is a key concern around the world, particularly given the threat of terrorism. Unfortunately, in many applications, agents face human adversaries who --- because of their cognitive limitations --- may deviate from expected rational behavior. Not considering these likely deviations when dealing with human adversaries may cause an unacceptable degredation in security measures designed to defend against serious attacks. We work towards understanding these cognitive limitations through experimentation and develop new algorithms to help handle the deviations these limitations create.

Description

Security, commonly defined as the ability to deal with intentional threats from other agents, is a major challenge for agents or agent-teams deployed in adversarial domains. Such adversarial scenarios arise in a wide variety of situations that are becoming increasingly important. Some example cases are agents patrolling to provide perimeter security around critical infrastructure or performing routine security checks.

Such security domains are often modeled as a Stackelberg game, where the defender (leader) commits to a strategy first and the adversary (follower) observers such strategy and response to it. Classical game theory provides a nice framework to solve such game for the defender based on a set of assumptions of adversaries' decision-making, such as the adversaries are perfect rational and they have complete information of the game. However, in many real security domains, defender is facing real human adversaries and most of these assumptions may not hold. Therefore, it is very important to generate more robust strategy for the defender by integrating more realistic model of human decision-making.

Anchoring Bias is when, given no information about the occurrence of a discrete set of events, humans will tend to assign an equal weight to the occurrence of each event (a uniform distribution).

Bounded Rationality: the follower is boundedly rational and may not strictly maximize utility. As a result, the follower may select an epslion-optimal response strategy, i.e. the follower may choose any of the responses within epsilon-reward of her optimal strategy.

Prospect Theory is a noble-prize-winning theory, which describes human decision making as a process of maximizing `prospect'. 

Quantal Response: Quantal Response Equilibrium suggests that instead of strictly maximizing utility, individuals respond stochastically in games: the chance of selecting a non-optimal strategy increases as the cost of such an error decreases.

We have developed COBRA which accounts for the anchoring bias human adversaries may experience when interpreting the probability of defender's mixed-strategy and their bounded rationality in computing the best response. In the experiment with human subjects, COBRA outperforms the perfect rational model (DOBSS) with statistical significance.

We also work toward developing new algorithms to account for the cognitive limitations of human adversaries. We have applied two fundamental theories of human behavior to predict an attacker's decision: Prospect Theory and Quantal Response Equilibrium. In the experiments with human subjects, BRQR (i.e. algorithm to compute best response of defender against quantal response of adversary) outperforms COBRA with statistical significance.

The Guards and The Treasure

This game is developed to simulate the real-world security. In this game, participants are asked to play as attacker. There are several doors that could choosen to attack, behind each of which there would be treasure or a guard. The probability for each door to have a guard behind it as well as the value of the treasure behind the door and the penalty of being caught by the guard are all known. The mission is to choose one door to attack to win as high payoff as possible.

Please follow the link below to start playing the game:
http://www.teamcoreusc.com/Rong/

The Game:

We developed the online game [The Guards and The Treasure] to simulate security senario at the LAX airport. Play the game, help us understand how human behavior in the security game. We are trying to design better strategy to protect the facilities against real attackers. Recently, we have developed three new algorithms which are based on two important models in human behavior literature. We tested our new algorithms using the game with human subjects. Our new algorithms outperformed the current leading contender with statistical sigfinicance.

In total, we we tested 7 different payoff structures. For each payoff structures, we generated defender strategies for the following five different algorithms to play against human subjects. The score (average expected utility for the defender) of each stragies are displayed in the following graph for each payoff structure.

  • BRPT: Best Response to Prospect Theory [.pdf]

  • RPT: Robust Prospect Theory [.pdf]

  • BRQR: Best Response to Quantal Response [.pdf]

  • COBRA: Combined Observability and Bounded Rationality Assumption [.pdf]

  • DOBSS: Decomposed Optimal Bayesian Stackelberg Solver [.pdf]

Overall, BRQR showed best performance; DOBSS and BRPT had the worst performances.

Relevant Papers and Presentations

Document

Download


Effective Solutions for Real-World Stackelberg Games: When Agents Must Deal with Human Uncertainties (AAMAS-2009)
PDF

Using Game Theory for Los Angeles Airport Security
PDF

ARMOR Security for Los Angeles International Airport(AAAI-2008 Intelligent Systems Demonstration)
PDF

Bayesian Stackelberg Games and their Application for Security at Los Angeles International Airport (SIGecom Exchanges-2008)
PDF

Robust Solutions in Stackelberg Games: Addressing Boundedly Rational Human Preference Models
PDF

Robust Solutions to Stackelberg Games: Addressing Bounded Rationality and Limited Observations in Human Cognition (AIJ 2010)
PDF

Improved Computational Models of Human Behavior in Security Games (AAMAS-2011)
PDF

Improving Resource Allocation Strategy Against Human Adversaries in Security Games (IJCAI-2011)
PDF

If you have any questions about the contents of this page please contact James Pita ( jpita@usc.edu )