Find Jobs
Hire Freelancers

Deep Q network

$30-250 USD

I përfunduar
Postuar over 1 year ago

$30-250 USD

Paguhet në dorëzim
The aim of this project is to develop a reinforcement learning agent that aims at the target container and pours all the objects into it without spillage. You are provided a scene with two cups and two cubes. If you run the code, the cup should start rotating at a velocity randomly chosen from a predefined set (thee velocities are more diverse than in part two). Use the provided functions to move the pouring cup horizontally while rotating it. Action Space: The source cup can be moved along the X-axis by selecting an action from a predefined set [-2, -1,0, 1, 2]. You don’t have to move the cup in any other axis. State Space: The state space involves: The position of two cubes The position of source cup Velocities of the two cubes(optional) Task: the Q-Learning algorithm to develop the Q-table. For most problems, it is impractical to represent the Q-function as a table containing values for each combination of state and action. Because of that, in this part, you have to train a Deep Q-Network to estimate the Q-values. Submission: Submit your training and testing code Submit a video that shows cubes are falling to the target after you run your code Submit a .txt file that contain the log of the training, including the accumulated rewards for each episode, TD errors for each DQN update Submit a .txt file with the following result: How many times all the cubes fell successfully to the target in 100 trials if your state space consists only cup position? How many times all the cubes fell successfully to the target in 100 trials if your state space consists both cup position and two cube’s position?
ID e Projektit: 35343335

Rreth projektit

4 propozime
Projekt në distancë
Aktive 1 yr ago

Po kërkoni të fitoni para?

Përfitimet e ofertës për Freelancer

Vendosni buxhetin dhe afatin tuaj
Paguhuni për punën tuaj
Përshkruani propozimin tuaj
Është falas të regjistrohesh dhe të bësh oferta për punë
I dhënë për:
Avatari i Përdoruesit
Hi, How are you? Very happy to bid your project because my skills are fitted in your project. I have 8 years experience in Machine learning ,Deep learning ,NLP and AI. I am very familiar deep Q learning. If you send the message , we can discuss about the project more. Thanks. Loyid.
$100 USD në 2 ditë
4,7 (53 përshtypje)
6,9
6,9
4 freelancers are bidding on average $175 USD for this job
Avatari i Përdoruesit
Greetings I saw your project and as an expert in Python/ML/AI/DL I am sure I can do your task. I have previously worked on various state of the art deep learning projects which includes making models for text, music instruments, videos, stock market which makes me perfect person for your task. Feel free to contact me so we can discuss in detail about your project. Best Regards, Felberg B.
$200 USD në 7 ditë
4,9 (16 përshtypje)
3,9
3,9
Avatari i Përdoruesit
Hello, I have rich experience in Python coding for Q-learning. I have read all your explanations carefully and fully understand your requirements. So I am sure I can give you correct and good results. I would appreciate it if you could contact me soon and share your project details. Thank you.
$300 USD në 3 ditë
0,0 (0 përshtypje)
0,0
0,0
Avatari i Përdoruesit
Hi, I have master's level qualification in AI/ML/DL and 5 years of industry experience. I have read your description. You need to build a Q-learning NN for your pouring agent. I can help you with this project. Let's discuss deadlines over chat. Thanks,
$100 USD në 7 ditë
0,0 (0 përshtypje)
0,0
0,0

Rreth klientit

Flamuri i UNITED STATES
Tampa, United States
0,0
0
Mënyra e pagesës u verifikua
Anëtar që nga sht 18, 2022

Verifikimi i klientit

Faleminderit! Ne ju kemi dërguar me email një lidhje për të kërkuar kredinë tuaj falas.
Ndodhi një gabim gjatë dërgimit të email-it tuaj. Ju lutemi provoni përsëri.
Përdorues të regjistruar Punë të postuara
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Po ngarkohet shikimi paraprak
Leja u dha për Geolocation.
Seanca e hyrjes ka skaduar dhe ke dalë. Hyr sërish.