It's UWAweek 19

help3001

This forum is provided to promote discussion amongst students enrolled in CITS3001 Algorithms, Agents and Artificial Intelligence.

Please consider offering answers and suggestions to help other students! And if you fix a problem by following a suggestion here, it would be great if other interested students could see a short "Great, fixed it!"  followup message.

How do I ask a good question?
Displaying selected article
Showing 1 of 292 articles.
Currently 6 other people reading this forum.


SVG not supported

Login to reply

👍?
helpful
2:19pm Thu 24th Aug, Hai L.

A few questions on the project: 1. Do we get more marks for more complex agents? Or does it not make a difference as long as we implement and analyse them properly? It seems to me that a reinforcement learning agent would take a lot more in terms of code and understanding compared to a rule-based agent. 2. Is it actually feasible to make a normal Q-learning agent that uses a Q-table for this? The state space of the display is massive. I am attempting to simplify the contents of the display instead of using all the pixels but working based on the full dimensions of the screen still looks to be too big. I was thinking of limiting Mario's vision further, but I just want to know if this has been implemented successfully before, since everything I've read online just tells me Q-tables are no good for mario and to use DQN instead. 3. How much of the implementation can we leave to outside libraries? If, say, I end up using neural networks, the Keras library can do a lot of the heavy-lifting while you mainly just feed it data and run training (I think?). It seems to simplify things a bit too much. Tell me if I have any misconceptions about the agents, I know we have a reinforcement learning lecture further down the line but we have a basic rule-based agent going so I want to start planning for the second.

The University of Western Australia

Computer Science and Software Engineering

CRICOS Code: 00126G
Written by [email protected]
Powered by history
Feedback always welcome - it makes our software better!
Last modified  8:08AM Aug 25 2024
Privacy policy