I have programmed an OpenAI gym environment that models a kind of simple single-player game. I need someone to chose the right ML algorithm to solve it, set it up, tune it, and train it on my environment until a consistent high score is achieved.
The job includes reviewing the specs of my environment and modifying them if necessary (mainly observation/action spaces and reward system). I can assist with modifying the environment so you don't spend your time on that part and focus on the agent and training.
Please review carefully the attached the details and scope of work of my project and give me your best quote for the job.