About Projects CV


photo

cuijiaxun AT utexas DOT edu

Jiaxun Cui (崔佳勋)

cuijiaxun AT utexas DOT edu / GitHub / CV

I'm a third-year Electrical and Computer Engineering PhD student at the University of Texas at Austin, with my ambitious goal of building a general AI for human benefits. I completed my undergraduate study at Shanghai Jiao Tong University with a major in Mechanical Engineering(Robotics). Currently, I am working on an on-going project at the Learning Agent Research Group(LARG), which is led by professor Peter Stone. Besides research, I like playing Tennis and Rap Music.

Research

My current research topics include:

  • Multi-agent Reinforcement Learning
  • Game Theory
  • Robotics

Projects

Cooperative Perception for Decision Making in Autonomous Driving [paper] [code]

There are dangerous scenarios where single optical based sensor are occluded. We develop a series of benchmark scenarios in the CARLA simulator where cooperative perception can make big difference in the decisions made by the autonomous vehicles.

Meta-Learning for Multiround Chinese Standard Mahjong Game [paper] [code]

Chinese Standard Mahjong Game is a 4-player zero-sum imperfect information game. While in the formal competitions of the mahjong game, a player will compete with the other opponents for a fixed number of rounds, and then the relative cumulative reward/points are compared. It is no longer a zero-sum game and the agents only care about a relative better performance than other players. This competition mechanism induces multi-round tactics. For example, an agent may play conservatively to secure its ranking or play riskly when it is left behind.

Scalable Multiagent Driving Policies For Reducing Traffic Congestion, AAMAS 2021 [paper] [code]

In the mordern highway system, the issue of conjestions brings inefficiency in traffic throughput and also cause waste of energy consumption. In this project, we employed deep Reinforcement Learning method to alleviate stop-and-go waves arosed from merging. We explored two aspects of multi-agent learning: Centralized and Distributed. Evaluting only average speed and throughput efficiency could be toublesome because the two metrics may not be improved at the same time. Thus, we developed a new metric: time delay for a certain number of vehicles to enter and exit the networks.