-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reproducing BC baseline results on soft body envs #5
Comments
I recommend modify the demo replay buffer config file in this case:
i.e. thru For BC, there is only 1 replay buffer, so replace the above Note that for non-BC algorithms, |
I'm running on a machine with a 3090 and 64GB RAM, so I lowered to python maniskill2_learn/apis/run_rl.py configs/brl/bc/pointnet_soft_body.py \
--work-dir workdir/ --gpu-ids 0 \
--cfg-options "env_cfg.env_name=Pinch-v0" "env_cfg.obs_mode=pointcloud" "env_cfg.n_points=1200" "env_cfg.obs_frame=ee" \
"env_cfg.reward_mode=dense" \
"env_cfg.control_mode=pd_ee_delta_pose" \
"replay_cfg.buffer_filenames=../ManiSkill2/demos/soft_body_envs/Pinch-v0/trajectory.none.pd_ee_delta_pose_pointcloud.h5" \
"replay_cfg.capacity=5000" "replay_cfg.num_samples=-1" "replay_cfg.cache_size=5000" \
"replay_cfg.dynamic_loading=True" "replay_cfg.synchronized=False" \
"eval_cfg.num=100" "eval_cfg.save_traj=False" "eval_cfg.save_video=True" \
"train_cfg.n_eval=50000" "train_cfg.total_steps=50000" "train_cfg.n_checkpoint=50000" "train_cfg.n_updates=500" Still unable to train pointcloud BC baseline. GPU utilization shows 0%, occasionally increasing to 3-8%. Attached the log. |
Does it report anything if you set If it reports, then it means it's training, it's just really slow due to file io. BTW Is the demo stored on ssd? |
Also you can do some custom processing in env wrappers and implement new architectures if you implement your own approach, since |
Yes, the demos are on root ssd. Seems to start training, but grad_norm becomes 0 pretty quickly. True for
|
Was able to reproduce it for point cloud BC. Though for RGB-D BC, the gradient does not fall to zero. (RGB-D BC also requires more memory). |
I'm having trouble reproducing results on Pinch-v0. I was able to get Write-v0 and Hang-v0 working though.
Here's the commands I'm running: demo conversion with general_soft_body_envs.txt and
scripts/example_training/bc_soft_body_pointcloud.sh
:I've also tried with
env_cfg.control_mode=pd_ee_target_delta_pose
How much memory would be needed to run with
replay_cfg.num_samples=-1
? Or is there a better way of training with all 1500+ demos usingreplay_cfg.dynamic_loading=True
?The text was updated successfully, but these errors were encountered: