Natural Language Object Retrieval
Ground the object via Natural Language (referring expression).
Video Object Detection using Faster R-CNN
Training faster rcnn on other dataset, such as ImageNet, and use tracker to track frame-by-frame.
Human-level control through deep reinforcement learning
Create an agent that is capable of controlling or playing different kind of games.