AntMuJoCoEnv-v0 contains many unnecessary states. #66

seolhokim · 2021-03-27T12:39:58Z

env = gym.make("AntMuJoCoEnv-v0")
env.reset()
for i in range(10):
    state,_,_,_ = env.step(env.action_space.sample())
print(state)

array([ 0.48600267, -0.02747473, -0.0488695 ,  0.55320414,  0.76180714,
       -0.46062074,  0.7321935 ,  0.43551934, -0.59654198,  0.01406484,
       -0.9421499 ,  0.35980088,  0.96942428,  0.34243858,  0.07390555,
       -0.01478774, -0.01189533,  0.00519861,  0.10619358, -0.02251298,
        0.04225551,  0.03821672, -0.03983905,  0.0030749 , -0.06877575,
        0.03366648,  0.06564061,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ,  0.        ,  0.        ,  0.        ,  0.        ,
        0.        ])

I think we can remove many states after a certain index, what do you think?

The text was updated successfully, but these errors were encountered:

sash-a · 2021-03-28T20:28:46Z

I was just about to ask about the root cause of this issue. This is the line:

pybullet-gym/pybulletgym/envs/mujoco/robots/locomotors/ant.py

Line 21 in ec9e874

cfrc_ext = np.zeros((14, 6)) # shape (14, 6) # TODO: FIND cfrc_ext

I'm reasonably familiar with pybullet, but not at all with mujoco so if someone could tell me what cfrc_ext is actually referring to I wouldn't mind having a dig round to see if I could find it (no promises that I will find it though).

sash-a · 2021-03-28T20:44:06Z

Ok a quick update cfrc_ext refers to the contact points, I would imagine that it is in there for ground contact points, but the (14, 6) shape doesn't really make sense then as you would expect (x, 4) or (4, x) shape for the 4 feet. According to this thread those values are often, but not always 0 in mujoco also. So @seolhokim I wouldn't worry too much about it, it should definitely be left in so that the shape matches the mujoco envs, if you dig through the code you will find quite a lot of areas where the states values are simply set to 0 to maintain the same shape as the mujoco envs.

According to this thread it does seem possible to get the forces, at least in C, hopefully these methods are callable in python also. @benelot are you accepting pull requests? If so I will try and find something that will work. Also any idea as to why the shape is (14, 6)? My best guess is that the ant has 14 body parts and the contact force vector is of length 6, but I'm really not sure.

benelot · 2021-03-28T20:44:51Z

When building all the envs, I firstly wanted them to comply with the observation sizes of the original mujoco envs. This is why I added 0s to the observation values I know to make it the same length. Then I intended to find all the missing observation values in mujoco to find the corresponding ones in pybullet, but I could not find any description of those at all!

Check it out here: https://github.com/openai/gym/blob/c8a659369d98706b3c98b84b80a34a832bbdc6c0/gym/envs/mujoco/ant.py#L35

If anybody finds out what crfc_ext is in mujoco and what it corresponds to in pybullet, I am happy to help you work all this out.

benelot · 2021-03-28T20:45:26Z

I absolutely accept pull requests.

benelot · 2021-03-28T20:47:27Z

Contact points? Maybe then it is similar to the ant in roboschool that I ported here as well. There is also some contact point related stuff there if I remember well.

benelot · 2021-03-28T20:48:58Z

Btw, these missing observations are all over the place in the mujoco code.If you are eager, talented and interested to figure out what they are, I am really happy to help if I can, as I somehow was not able to figure it out at the time.

sash-a · 2021-03-28T21:10:34Z

Sure I'll definitely have a dig around for these contact points, can't commit to all the other missing values, but if I have some extra time I will look for them.

I do see foot contact points here

pybullet-gym/pybulletgym/envs/mujoco/envs/locomotion/walker_base_env.py

Lines 70 to 78 in ec9e874

    
           for i,f in enumerate(self.robot.feet):  # TODO: Maybe calculating feet contacts could be done within the robot code 
        
               contact_ids = set((x[2], x[4]) for x in f.contact_list()) 
        
               # print("CONTACT OF '%d' WITH %d" % (contact_ids, ",".join(contact_names)) ) 
        
               if self.ground_ids & contact_ids: 
        
                   # see Issue 63: https://github.com/openai/roboschool/issues/63 
        
                   # feet_collision_cost += self.foot_collision_cost 
        
                   self.robot.feet_contact[i] = 1.0 
        
               else: 
        
                   self.robot.feet_contact[i] = 0.0

But as this is only for 4 feet I don't think it would give us the correct shape. I think I would need to have a proper dig around the mujoco code to see why the array is the shape that it is, which I will do tomorrow or the next day :)

seolhokim · 2021-03-30T07:07:26Z

@benelot

111-dim observation space

z (height) of the Torso -> 1

orientation (quarternion x,y,z,w) of the Torso -> 4

8 Joiint angles -> 8

3-dim directional velocity and 3-dim angular velocity -> 3+3=6

8 Joint velocity -> 8

External forces (force x,y,z + torque x,y,z) applied to the CoM of each link (Ant has 14 links: ground+torso+12(3links for 4legs) for legs -> (3+3)*(14)=84

I found this in https://enginius.tistory.com/734

Is it helpful to fill in the exact state?

sash-a · 2021-03-30T08:10:06Z

External forces (force x,y,z + torque x,y,z) applied to the CoM of each link (Ant has 14 links: ground+torso+12(3links for 4legs) for legs -> (3+3)*(14)=84

@seolhokim thanks, this is super helpful! I should have some time later today to see if I can work out how to get those external forces in pybullet :)

seolhokim · 2021-03-30T10:04:58Z

@sash-a Oh I forgot to tag you. I believe you can do that! Thank you! :)

benelot · 2021-03-30T12:12:52Z

Sounds great. In a future refactoring, these foot contact calculations could then go into a "mujoco layer" that is able to generate appropriate observations for all types of robots in environments such that replacing mujoco with open source is more achievable in the future.

@seolhokim The initial intent of this project was to replace Mujoco implemented openai gym envs with open source software. Therefore I try to reproduce everything this entails, thus also the exact/approximate observational state. We will not get to the level where we get the exact same responses to the same actions taken in both mujoco or pybullet, but I hope we get as close as to let mujoco trained agents run in pybullet and they achieve similar performance.

sash-a · 2021-04-01T17:04:54Z

I'm pretty sure I've found what we're looking for although it doesn't look like it's going to work. According to the docs we would need a torque sensor on the joints, which I think means we would have to modify the xml assets and I'm not too sure if that's a good idea.

The method is getJointState and one of it's outputs is jointReactionForces which according to the docs is "list of 6 floats | There are the joint reaction forces, if a torque sensor is enabled for this joint. Without torque sensor, it is [0,0,0,0,0,0]". I'm pretty sure this is what we are looking for, however I never found this to produce a value other than 0 which means that the joints don't have torque sensors. I think given this thread we can leave them as is, mujoco seems to have a similar problem.

If you want to have a look for yourself I put up a quick way to use this method in this commit ca7ab78

GPaolo · 2021-05-18T14:57:26Z

The crfc_ext are the contact forces. But Mujoco 2.0 has issues with those and is just returning zeros (openai/gym#1541)

This was referenced Apr 1, 2021

Fixed Ant never sending the done signal sash-a/pybullet-gym#1

Open

Fixed Ant never sending the done signal #67

Open

Rohan138 mentioned this issue Oct 24, 2021

Update on Plans for the MuJoCo, Robotics and Box2d Environments and the Status of Brax and Hardware Accelerated Environments in Gym openai/gym#2456

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AntMuJoCoEnv-v0 contains many unnecessary states. #66

AntMuJoCoEnv-v0 contains many unnecessary states. #66

seolhokim commented Mar 27, 2021 •

edited

Loading

sash-a commented Mar 28, 2021

sash-a commented Mar 28, 2021

benelot commented Mar 28, 2021

benelot commented Mar 28, 2021

benelot commented Mar 28, 2021

benelot commented Mar 28, 2021

sash-a commented Mar 28, 2021

seolhokim commented Mar 30, 2021

sash-a commented Mar 30, 2021

seolhokim commented Mar 30, 2021

benelot commented Mar 30, 2021

sash-a commented Apr 1, 2021

GPaolo commented May 18, 2021

AntMuJoCoEnv-v0 contains many unnecessary states. #66

AntMuJoCoEnv-v0 contains many unnecessary states. #66

Comments

seolhokim commented Mar 27, 2021 • edited Loading

sash-a commented Mar 28, 2021

sash-a commented Mar 28, 2021

benelot commented Mar 28, 2021

benelot commented Mar 28, 2021

benelot commented Mar 28, 2021

benelot commented Mar 28, 2021

sash-a commented Mar 28, 2021

seolhokim commented Mar 30, 2021

sash-a commented Mar 30, 2021

seolhokim commented Mar 30, 2021

benelot commented Mar 30, 2021

sash-a commented Apr 1, 2021

GPaolo commented May 18, 2021

seolhokim commented Mar 27, 2021 •

edited

Loading