Long Ji Lin: Self-improvement Based on Reinforcement Learning, Planning and Teaching. ML 1991: 323-327