Find Jobs
Hire Freelancers

Implementation of Q-learning using n*n matrix

$10-30 USD

Opravljeno
Objavljeno pred približno 4 leti

$10-30 USD

Plačilo ob dostavi
Here is the portal for uploading your HW: Implementation of Q-learning. A short rubric: Total points : 20 - Correct initialization ( proper n*n Q-matrix, R matrix or vector, etc. according to your implementation ): 3 points - Correct transition function or matrix to get the next state given the current state and the action: 3 points - Correct function or code block for choosing a random and valid action, or similar. 3 points - Implement episode iterations, calculate q value and update q matrix correctly : 6 points - Return the correct path of reaching the goal state given Q matrix : 5 points (this means you need to create a concrete gridworld using your implementation and find the solution) Ps: you can set learning rate alpha equal to 1 so as to use the simplest form of q equation : Q(s,a) <-- R(s,a) + gamma* max ( Q'(s',a')) in the homework. Extra Credit: - Show the update of q matrix every N episodes ( You choose N ) : 1 points - Set alpha between (0,1) : 2 points - Implement a simple GUI which shows the movement of agent or the change of policy: 2 points
ID projekta: 23934428

Več o projektu

1 ponudba
Projekt na daljavo
Aktivno pred 4 leti

Želite zaslužiti?

Prednosti oddajanja ponudb na Freelancerju

Nastavite svoj proračun in časovni okvir
Prejmite plačilo za svoje delo
Povzetek predloga
Registracija in oddajanje ponudb sta brezplačna
Dodeljeno:
Avatar uporabnika
Hello , I am Ibad and i my self a software engineer and completed my Mphil in software engg. So my core is Mathematics and Artificial intelligence. I will completely do this, just let me know when you want it. I am waiting
$94 USD v 3 dneh
5,0 (1 ocena)
1,7
1,7

O stranki

Zastava UNITED STATES
San Bernardino, United States
5,0
1
Plačilna metoda je verificirana
Član(ica) od nov. 3, 2019

Verifikacija stranke

Hvala! Po e-pošti smo vam poslali povezavo za prevzem brezplačnega dobropisa.
Pri pošiljanju vašega e-sporočila je šlo nekaj narobe. Poskusite znova.
Registrirani uporabniki Skupaj objavljenih del
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Nalaganje predogleda
Geolociranje je bilo dovoljeno.
Vaša prijavna seja je potekla, zato ste bili odjavljeni. Prosimo, da se znova prijavite.