-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Doc] Better doc for TensorDictModuleBase #1226
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This was referenced Feb 20, 2025
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 39.9240μs | 20.8727μs | 47.9095 KOps/s | 47.7438 KOps/s | |
test_plain_set_stack_nested | 48.3400μs | 20.9243μs | 47.7912 KOps/s | 46.7661 KOps/s | |
test_plain_set_nested_inplace | 76.5020μs | 22.3951μs | 44.6526 KOps/s | 43.9946 KOps/s | |
test_plain_set_stack_nested_inplace | 65.0610μs | 22.4626μs | 44.5184 KOps/s | 43.7593 KOps/s | |
test_items | 20.1670μs | 4.2823μs | 233.5190 KOps/s | 233.3558 KOps/s | |
test_items_nested | 0.4895ms | 0.4046ms | 2.4716 KOps/s | 2.4612 KOps/s | |
test_items_nested_locked | 0.4799ms | 0.4051ms | 2.4685 KOps/s | 2.4649 KOps/s | |
test_items_nested_leaf | 0.1490ms | 76.4831μs | 13.0748 KOps/s | 12.8638 KOps/s | |
test_items_stack_nested | 0.5262ms | 0.4069ms | 2.4576 KOps/s | 2.4390 KOps/s | |
test_items_stack_nested_leaf | 0.1377ms | 78.8611μs | 12.6805 KOps/s | 12.5477 KOps/s | |
test_items_stack_nested_locked | 0.5426ms | 0.4036ms | 2.4775 KOps/s | 2.4570 KOps/s | |
test_keys | 24.3150μs | 3.4733μs | 287.9094 KOps/s | 287.9055 KOps/s | |
test_keys_nested | 0.2540ms | 0.1634ms | 6.1218 KOps/s | 6.0139 KOps/s | |
test_keys_nested_locked | 1.7066ms | 0.1702ms | 5.8762 KOps/s | 5.9141 KOps/s | |
test_keys_nested_leaf | 0.2240ms | 0.1428ms | 7.0024 KOps/s | 6.9833 KOps/s | |
test_keys_stack_nested | 0.3385ms | 0.1597ms | 6.2618 KOps/s | 6.1306 KOps/s | |
test_keys_stack_nested_leaf | 0.2217ms | 0.1372ms | 7.2891 KOps/s | 7.1062 KOps/s | |
test_keys_stack_nested_locked | 0.3427ms | 0.1696ms | 5.8949 KOps/s | 5.8834 KOps/s | |
test_values | 4.4162μs | 1.1202μs | 892.7061 KOps/s | 965.1508 KOps/s | |
test_values_nested | 0.1178ms | 61.6038μs | 16.2328 KOps/s | 15.9374 KOps/s | |
test_values_nested_locked | 0.1034ms | 61.7860μs | 16.1849 KOps/s | 16.1703 KOps/s | |
test_values_nested_leaf | 0.1314ms | 70.9499μs | 14.0945 KOps/s | 13.9884 KOps/s | |
test_values_stack_nested | 0.1628ms | 64.0529μs | 15.6121 KOps/s | 15.0541 KOps/s | |
test_values_stack_nested_leaf | 0.1262ms | 68.7782μs | 14.5395 KOps/s | 14.0761 KOps/s | |
test_values_stack_nested_locked | 0.1190ms | 64.0544μs | 15.6117 KOps/s | 15.6655 KOps/s | |
test_membership | 4.8819μs | 0.7032μs | 1.4220 MOps/s | 1.1602 MOps/s | |
test_membership_nested | 22.4620μs | 2.9390μs | 340.2562 KOps/s | 345.6892 KOps/s | |
test_membership_nested_leaf | 51.3210μs | 2.9941μs | 333.9863 KOps/s | 343.6942 KOps/s | |
test_membership_stacked_nested | 68.5170μs | 2.9363μs | 340.5602 KOps/s | 345.6537 KOps/s | |
test_membership_stacked_nested_leaf | 25.6680μs | 2.9302μs | 341.2741 KOps/s | 347.0577 KOps/s | |
test_membership_nested_last | 48.8700μs | 4.3434μs | 230.2335 KOps/s | 229.3810 KOps/s | |
test_membership_nested_leaf_last | 30.6270μs | 4.3921μs | 227.6814 KOps/s | 229.4094 KOps/s | |
test_membership_stacked_nested_last | 39.4830μs | 13.3592μs | 74.8548 KOps/s | 228.3242 KOps/s | |
test_membership_stacked_nested_leaf_last | 48.0790μs | 13.3786μs | 74.7461 KOps/s | 227.6715 KOps/s | |
test_nested_getleaf | 34.2340μs | 10.4849μs | 95.3753 KOps/s | 93.5418 KOps/s | |
test_nested_get | 35.9370μs | 9.9463μs | 100.5401 KOps/s | 99.1623 KOps/s | |
test_stacked_getleaf | 30.1060μs | 10.4578μs | 95.6228 KOps/s | 94.2740 KOps/s | |
test_stacked_get | 26.7800μs | 9.9016μs | 100.9937 KOps/s | 100.3321 KOps/s | |
test_nested_getitemleaf | 32.3900μs | 11.0819μs | 90.2375 KOps/s | 88.9134 KOps/s | |
test_nested_getitem | 30.3160μs | 10.5010μs | 95.2290 KOps/s | 92.3356 KOps/s | |
test_stacked_getitemleaf | 43.1100μs | 10.8417μs | 92.2364 KOps/s | 88.3297 KOps/s | |
test_stacked_getitem | 29.5640μs | 10.5123μs | 95.1270 KOps/s | 92.5421 KOps/s | |
test_lock_nested | 0.7595ms | 0.4149ms | 2.4103 KOps/s | 2.3953 KOps/s | |
test_lock_stack_nested | 0.7159ms | 0.4106ms | 2.4357 KOps/s | 2.3388 KOps/s | |
test_unlock_nested | 0.6650ms | 0.3349ms | 2.9861 KOps/s | 3.0001 KOps/s | |
test_unlock_stack_nested | 0.4459ms | 0.3286ms | 3.0431 KOps/s | 2.9486 KOps/s | |
test_flatten_speed | 0.2022ms | 0.1004ms | 9.9645 KOps/s | 9.8479 KOps/s | |
test_unflatten_speed | 0.6196ms | 0.5209ms | 1.9199 KOps/s | 1.9402 KOps/s | |
test_common_ops | 6.5000ms | 0.8144ms | 1.2279 KOps/s | 1.2406 KOps/s | |
test_creation | 24.1340μs | 2.5614μs | 390.4050 KOps/s | 407.2768 KOps/s | |
test_creation_empty | 37.5890μs | 12.6086μs | 79.3112 KOps/s | 80.8793 KOps/s | |
test_creation_nested_1 | 34.6650μs | 15.9835μs | 62.5644 KOps/s | 65.1039 KOps/s | |
test_creation_nested_2 | 50.9450μs | 20.4859μs | 48.8141 KOps/s | 48.6440 KOps/s | |
test_clone | 73.7370μs | 13.5711μs | 73.6860 KOps/s | 74.4786 KOps/s | |
test_getitem[int] | 0.7814ms | 13.0103μs | 76.8622 KOps/s | 79.2740 KOps/s | |
test_getitem[slice_int] | 0.1385ms | 24.4700μs | 40.8663 KOps/s | 40.4392 KOps/s | |
test_getitem[range] | 0.1587ms | 49.0007μs | 20.4079 KOps/s | 21.0814 KOps/s | |
test_getitem[tuple] | 0.1225ms | 20.3893μs | 49.0452 KOps/s | 49.0680 KOps/s | |
test_getitem[list] | 0.3071ms | 43.8584μs | 22.8007 KOps/s | 23.3015 KOps/s | |
test_setitem_dim[int] | 41.5870μs | 24.9376μs | 40.1002 KOps/s | 40.2654 KOps/s | |
test_setitem_dim[slice_int] | 90.7880μs | 48.9017μs | 20.4492 KOps/s | 19.8601 KOps/s | |
test_setitem_dim[range] | 0.1221ms | 73.6237μs | 13.5826 KOps/s | 13.6629 KOps/s | |
test_setitem_dim[tuple] | 74.3890μs | 39.5768μs | 25.2673 KOps/s | 25.1400 KOps/s | |
test_setitem | 67.5650μs | 20.7777μs | 48.1285 KOps/s | 47.5932 KOps/s | |
test_set | 71.4120μs | 20.4261μs | 48.9570 KOps/s | 49.0297 KOps/s | |
test_set_shared | 0.3028ms | 0.1782ms | 5.6114 KOps/s | 5.6354 KOps/s | |
test_update | 81.2410μs | 23.7216μs | 42.1556 KOps/s | 41.8660 KOps/s | |
test_update_nested | 0.1205ms | 34.1844μs | 29.2531 KOps/s | 29.0198 KOps/s | |
test_update__nested | 0.5775ms | 33.1496μs | 30.1663 KOps/s | 29.6423 KOps/s | |
test_set_nested | 73.1260μs | 22.0995μs | 45.2498 KOps/s | 44.2157 KOps/s | |
test_set_nested_new | 72.4150μs | 27.4971μs | 36.3675 KOps/s | 36.8868 KOps/s | |
test_select | 0.1214ms | 44.1382μs | 22.6561 KOps/s | 23.7641 KOps/s | |
test_select_nested | 0.2888ms | 63.5553μs | 15.7343 KOps/s | 15.9682 KOps/s | |
test_exclude_nested | 0.1542ms | 80.9604μs | 12.3517 KOps/s | 12.4637 KOps/s | |
test_empty[True] | 0.5786ms | 0.4044ms | 2.4725 KOps/s | 2.4701 KOps/s | |
test_empty[False] | 6.4070μs | 1.3922μs | 718.2780 KOps/s | 735.1953 KOps/s | |
test_unbind_speed | 0.4144ms | 0.2686ms | 3.7236 KOps/s | 3.7692 KOps/s | |
test_unbind_speed_stack0 | 0.4274ms | 0.2580ms | 3.8759 KOps/s | 3.7980 KOps/s | |
test_unbind_speed_stack1 | 0.8191ms | 0.6261ms | 1.5973 KOps/s | 1.2800 KOps/s | |
test_split | 96.1401ms | 1.7534ms | 570.3352 Ops/s | 634.4184 Ops/s | |
test_chunk | 99.4976ms | 1.7738ms | 563.7691 Ops/s | 529.6098 Ops/s | |
test_consolidate_njt[False-None] | 8.3284ms | 8.0658ms | 123.9809 Ops/s | 120.1684 Ops/s | |
test_creation[device0] | 4.1357ms | 92.4158μs | 10.8207 KOps/s | 10.6392 KOps/s | |
test_creation_from_tensor | 0.2315ms | 92.4880μs | 10.8122 KOps/s | 10.4434 KOps/s | |
test_add_one[memmap_tensor0] | 0.1470ms | 4.7812μs | 209.1538 KOps/s | 204.7822 KOps/s | |
test_contiguous[memmap_tensor0] | 21.5500μs | 0.5131μs | 1.9490 MOps/s | 1.9292 MOps/s | |
test_stack[memmap_tensor0] | 23.3130μs | 3.4549μs | 289.4398 KOps/s | 296.2885 KOps/s | |
test_memmaptd_index | 1.3071ms | 0.2280ms | 4.3860 KOps/s | 4.3756 KOps/s | |
test_memmaptd_index_astensor | 0.6244ms | 0.3140ms | 3.1844 KOps/s | 3.1828 KOps/s | |
test_memmaptd_index_op | 0.7799ms | 0.5906ms | 1.6933 KOps/s | 1.6785 KOps/s | |
test_serialize_model | 0.2181s | 0.1345s | 7.4366 Ops/s | 8.6574 Ops/s | |
test_serialize_model_pickle | 0.4467s | 0.3867s | 2.5858 Ops/s | 2.5047 Ops/s | |
test_serialize_weights | 0.1210s | 0.1130s | 8.8507 Ops/s | 8.8589 Ops/s | |
test_serialize_weights_returnearly | 0.1729s | 0.1605s | 6.2304 Ops/s | 5.6750 Ops/s | |
test_serialize_weights_pickle | 0.5614s | 0.4458s | 2.2433 Ops/s | 2.4788 Ops/s | |
test_serialize_weights_filesystem | 0.2438s | 0.1564s | 6.3918 Ops/s | 7.1444 Ops/s | |
test_serialize_model_filesystem | 0.1572s | 0.1456s | 6.8662 Ops/s | 6.7861 Ops/s | |
test_reshape_pytree | 56.5650μs | 25.8071μs | 38.7491 KOps/s | 37.4230 KOps/s | |
test_reshape_td | 96.2980μs | 32.5460μs | 30.7257 KOps/s | 30.8231 KOps/s | |
test_view_pytree | 62.5160μs | 25.6498μs | 38.9866 KOps/s | 37.7104 KOps/s | |
test_view_td | 86.6710μs | 39.4698μs | 25.3358 KOps/s | 25.7801 KOps/s | |
test_unbind_pytree | 73.0650μs | 28.5401μs | 35.0384 KOps/s | 33.9070 KOps/s | |
test_unbind_td | 0.3387ms | 39.4756μs | 25.3321 KOps/s | 25.0757 KOps/s | |
test_split_pytree | 0.1019ms | 28.8975μs | 34.6050 KOps/s | 34.3963 KOps/s | |
test_split_td | 0.5156ms | 46.0719μs | 21.7052 KOps/s | 22.0990 KOps/s | |
test_add_pytree | 83.2650μs | 36.0721μs | 27.7223 KOps/s | 27.7561 KOps/s | |
test_add_td | 0.1353ms | 58.2762μs | 17.1597 KOps/s | 17.0550 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1419ms | 67.0799μs | 14.9076 KOps/s | 15.2434 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3689ms | 0.1717ms | 5.8234 KOps/s | 5.8678 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1566ms | 46.4169μs | 21.5439 KOps/s | 22.0084 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2353ms | 0.1195ms | 8.3709 KOps/s | 8.4768 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 70.7910μs | 28.0044μs | 35.7087 KOps/s | 35.3495 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1653ms | 59.7508μs | 16.7362 KOps/s | 17.1099 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1705ms | 79.2429μs | 12.6194 KOps/s | 12.1892 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1389ms | 66.7353μs | 14.9846 KOps/s | 14.6601 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2285ms | 0.1075ms | 9.3012 KOps/s | 9.4384 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4236ms | 0.2158ms | 4.6344 KOps/s | 4.6643 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1274ms | 47.2878μs | 21.1471 KOps/s | 21.3663 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1923ms | 67.4062μs | 14.8354 KOps/s | 15.2212 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1737ms | 0.1003ms | 9.9656 KOps/s | 10.0354 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3142ms | 0.2007ms | 4.9833 KOps/s | 4.9202 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4098ms | 0.2296ms | 4.3560 KOps/s | 4.3295 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2551ms | 0.1102ms | 9.0730 KOps/s | 9.2090 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.3451ms | 63.9535μs | 15.6364 KOps/s | 16.0954 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1007ms | 47.9348μs | 20.8617 KOps/s | 20.8221 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.2595ms | 0.1565ms | 6.3918 KOps/s | 6.1969 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2206ms | 0.1007ms | 9.9265 KOps/s | 9.8968 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 73.4060μs | 21.6121μs | 46.2703 KOps/s | 46.1957 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1345ms | 70.3065μs | 14.2234 KOps/s | 14.9238 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1733ms | 79.2040μs | 12.6256 KOps/s | 12.1574 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1257ms | 66.0055μs | 15.1503 KOps/s | 14.8786 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3465ms | 0.2140ms | 4.6728 KOps/s | 4.6886 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.5677ms | 1.3563ms | 737.2834 Ops/s | 684.7717 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2993ms | 0.2095ms | 4.7727 KOps/s | 4.7847 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.0087ms | 0.8161ms | 1.2254 KOps/s | 1.2082 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5902ms | 0.4598ms | 2.1750 KOps/s | 2.1838 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.9469ms | 2.7016ms | 370.1493 Ops/s | 362.0866 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 96.7800μs | 39.3366μs | 25.4216 KOps/s | 25.5082 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.6230ms | 32.6362μs | 30.6408 KOps/s | 31.2352 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 87.0020μs | 30.3385μs | 32.9614 KOps/s | 31.8338 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 84.8980μs | 23.0062μs | 43.4666 KOps/s | 43.2451 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 85.7590μs | 31.4024μs | 31.8447 KOps/s | 30.7583 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 68.7580μs | 22.8199μs | 43.8213 KOps/s | 43.5325 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1157ms | 53.8545μs | 18.5685 KOps/s | 18.7258 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.3767ms | 20.3792μs | 49.0697 KOps/s | 48.6900 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1494ms | 46.3704μs | 21.5655 KOps/s | 21.5571 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 73.2460μs | 18.6942μs | 53.4925 KOps/s | 52.9627 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1186ms | 47.3521μs | 21.1184 KOps/s | 21.2487 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 73.5470μs | 18.7381μs | 53.3672 KOps/s | 53.2555 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1373ms | 55.3030μs | 18.0822 KOps/s | 18.6781 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0216ms | 19.8981μs | 50.2561 KOps/s | 49.2791 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1178ms | 46.7502μs | 21.3903 KOps/s | 21.0888 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 57.4060μs | 18.2788μs | 54.7081 KOps/s | 53.4622 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1093ms | 46.6920μs | 21.4169 KOps/s | 21.3373 KOps/s | |
test_compile_indexing[int-pytree-eager] | 56.1040μs | 18.5334μs | 53.9566 KOps/s | 53.3442 KOps/s | |
test_mod_add[eager] | 90.6380μs | 35.4380μs | 28.2183 KOps/s | 27.6757 KOps/s | |
test_mod_add[compile] | 0.1453ms | 65.8617μs | 15.1833 KOps/s | 14.6689 KOps/s | |
test_mod_add[compile-overhead] | 0.1253ms | 64.4586μs | 15.5138 KOps/s | 14.7392 KOps/s | |
test_mod_wrap[eager] | 0.4096ms | 0.2167ms | 4.6147 KOps/s | 4.5039 KOps/s | |
test_mod_wrap[compile] | 1.7524ms | 0.2262ms | 4.4216 KOps/s | 4.2768 KOps/s | |
test_mod_wrap[compile-overhead] | 0.3764ms | 0.2227ms | 4.4904 KOps/s | 4.3326 KOps/s | |
test_mod_wrap_and_backward[eager] | 17.5111ms | 12.8796ms | 77.6425 Ops/s | 75.6653 Ops/s | |
test_mod_wrap_and_backward[compile] | 18.0047ms | 11.7346ms | 85.2177 Ops/s | 89.6911 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 17.7536ms | 11.1944ms | 89.3303 Ops/s | 86.7172 Ops/s | |
test_seq_add[eager] | 0.2086ms | 0.1162ms | 8.6092 KOps/s | 8.4012 KOps/s | |
test_seq_add[compile] | 0.1364ms | 78.1181μs | 12.8011 KOps/s | 12.8317 KOps/s | |
test_seq_add[compile-overhead] | 0.1292ms | 75.7386μs | 13.2033 KOps/s | 13.1368 KOps/s | |
test_seq_wrap[eager] | 0.5587ms | 0.4403ms | 2.2712 KOps/s | 2.2309 KOps/s | |
test_seq_wrap[compile] | 0.3480ms | 0.2395ms | 4.1761 KOps/s | 4.1194 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3616ms | 0.2403ms | 4.1610 KOps/s | 4.1248 KOps/s | |
test_func_call_runtime[False-eager] | 1.2134ms | 0.5107ms | 1.9580 KOps/s | 1.9168 KOps/s | |
test_func_call_runtime[False-compile] | 0.5499ms | 0.4413ms | 2.2659 KOps/s | 2.2555 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5600ms | 0.4418ms | 2.2636 KOps/s | 2.2360 KOps/s | |
test_func_call_runtime[True-eager] | 0.8490ms | 0.7304ms | 1.3691 KOps/s | 1.3303 KOps/s | |
test_func_call_runtime[True-compile] | 0.6649ms | 0.4604ms | 2.1719 KOps/s | 2.1298 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.8232ms | 0.4646ms | 2.1526 KOps/s | 2.1308 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9266ms | 0.5066ms | 1.9739 KOps/s | 1.9275 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.7079ms | 0.4395ms | 2.2753 KOps/s | 2.2225 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.7370ms | 0.4373ms | 2.2866 KOps/s | 2.2395 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4422ms | 0.8724ms | 1.1463 KOps/s | 1.1205 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.4082ms | 0.7670ms | 1.3038 KOps/s | 1.2762 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.2534ms | 0.7749ms | 1.2906 KOps/s | 1.2618 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 3.0679ms | 1.8928ms | 528.3208 Ops/s | 525.2957 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8471ms | 0.5359ms | 1.8662 KOps/s | 1.8460 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7444ms | 0.5335ms | 1.8745 KOps/s | 1.8416 KOps/s | |
test_distributed | 0.6789ms | 0.1225ms | 8.1614 KOps/s | 7.9381 KOps/s | |
test_tdmodule | 0.1262ms | 27.3241μs | 36.5978 KOps/s | 35.5226 KOps/s | |
test_tdmodule_dispatch | 71.3230μs | 49.3140μs | 20.2782 KOps/s | 19.7077 KOps/s | |
test_tdseq | 90.4120μs | 28.5725μs | 34.9987 KOps/s | 33.7476 KOps/s | |
test_tdseq_dispatch | 92.2110μs | 54.5716μs | 18.3246 KOps/s | 17.9582 KOps/s | |
test_instantiation_functorch | 2.2593ms | 1.5097ms | 662.3783 Ops/s | 637.0373 Ops/s | |
test_exec_functorch | 0.2637ms | 0.1738ms | 5.7552 KOps/s | 5.5371 KOps/s | |
test_exec_functional_call | 0.2822ms | 0.1679ms | 5.9562 KOps/s | 5.8002 KOps/s | |
test_exec_td_decorator | 0.4985ms | 0.2294ms | 4.3600 KOps/s | 4.2224 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8318ms | 0.6487ms | 1.5415 KOps/s | 1.5197 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8407ms | 0.6553ms | 1.5260 KOps/s | 1.5303 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8213ms | 0.5212ms | 1.9185 KOps/s | 1.9108 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7348ms | 0.5198ms | 1.9240 KOps/s | 1.9122 KOps/s | |
test_to_module_speed[True] | 1.8883ms | 1.3315ms | 751.0560 Ops/s | 734.5349 Ops/s | |
test_to_module_speed[False] | 1.5623ms | 1.3010ms | 768.6318 Ops/s | 753.0804 Ops/s | |
test_tc_init | 0.1023ms | 47.5052μs | 21.0503 KOps/s | 20.4471 KOps/s | |
test_tc_init_nested | 0.1568ms | 93.4227μs | 10.7040 KOps/s | 10.2769 KOps/s | |
test_tc_first_layer_tensor | 25.8380μs | 1.6109μs | 620.7647 KOps/s | 632.7125 KOps/s | |
test_tc_first_layer_nontensor | 23.1430μs | 4.8279μs | 207.1295 KOps/s | 206.0971 KOps/s | |
test_tc_second_layer_tensor | 25.6380μs | 2.9316μs | 341.1157 KOps/s | 340.4805 KOps/s | |
test_tc_second_layer_nontensor | 28.4230μs | 6.2812μs | 159.2063 KOps/s | 161.6778 KOps/s | |
test_unbind | 0.2221s | 13.8387ms | 72.2609 Ops/s | 79.3138 Ops/s | |
test_full_like | 8.7760ms | 7.0524ms | 141.7953 Ops/s | 128.1882 Ops/s | |
test_zeros_like | 4.5030ms | 2.6869ms | 372.1791 Ops/s | 372.5175 Ops/s | |
test_ones_like | 3.9620ms | 3.0755ms | 325.1543 Ops/s | 200.7045 Ops/s | |
test_clone | 5.2735ms | 4.8237ms | 207.3105 Ops/s | 153.7428 Ops/s | |
test_squeeze | 75.4600μs | 12.8275μs | 77.9576 KOps/s | 76.6966 KOps/s | |
test_unsqueeze | 0.1462ms | 93.6526μs | 10.6778 KOps/s | 10.8278 KOps/s | |
test_split | 0.3460ms | 0.1975ms | 5.0642 KOps/s | 5.1592 KOps/s | |
test_permute | 0.2869ms | 0.1999ms | 5.0034 KOps/s | 4.9321 KOps/s | |
test_stack | 31.2497ms | 24.1418ms | 41.4219 Ops/s | 40.2470 Ops/s | |
test_cat | 31.2146ms | 23.9565ms | 41.7424 Ops/s | 40.3922 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 32.1010μs | 12.6891μs | 78.8081 KOps/s | 80.6923 KOps/s | |
test_plain_set_stack_nested | 33.8810μs | 12.8430μs | 77.8634 KOps/s | 80.4038 KOps/s | |
test_plain_set_nested_inplace | 43.2110μs | 13.8870μs | 72.0096 KOps/s | 74.6946 KOps/s | |
test_plain_set_stack_nested_inplace | 37.6000μs | 13.7904μs | 72.5140 KOps/s | 74.4205 KOps/s | |
test_items | 33.3900μs | 2.8580μs | 349.8912 KOps/s | 346.6872 KOps/s | |
test_items_nested | 0.4226ms | 0.3628ms | 2.7563 KOps/s | 2.7547 KOps/s | |
test_items_nested_locked | 0.4493ms | 0.3643ms | 2.7452 KOps/s | 2.7256 KOps/s | |
test_items_nested_leaf | 95.3610μs | 60.5266μs | 16.5216 KOps/s | 16.5710 KOps/s | |
test_items_stack_nested | 0.4243ms | 0.3627ms | 2.7571 KOps/s | 2.7733 KOps/s | |
test_items_stack_nested_leaf | 95.5120μs | 60.3953μs | 16.5576 KOps/s | 16.5383 KOps/s | |
test_items_stack_nested_locked | 0.4319ms | 0.3604ms | 2.7748 KOps/s | 2.7606 KOps/s | |
test_keys | 44.8810μs | 3.3981μs | 294.2849 KOps/s | 291.1430 KOps/s | |
test_keys_nested | 0.1352ms | 89.1112μs | 11.2219 KOps/s | 11.3286 KOps/s | |
test_keys_nested_locked | 0.6945ms | 94.5838μs | 10.5726 KOps/s | 10.6237 KOps/s | |
test_keys_nested_leaf | 0.1446ms | 79.4935μs | 12.5796 KOps/s | 12.6395 KOps/s | |
test_keys_stack_nested | 0.1268ms | 88.6211μs | 11.2840 KOps/s | 11.3716 KOps/s | |
test_keys_stack_nested_leaf | 0.1090ms | 80.0213μs | 12.4967 KOps/s | 12.6383 KOps/s | |
test_keys_stack_nested_locked | 0.1312ms | 94.3724μs | 10.5963 KOps/s | 10.6153 KOps/s | |
test_values | 6.1335μs | 0.8493μs | 1.1774 MOps/s | 1.1729 MOps/s | |
test_values_nested | 63.2110μs | 37.6048μs | 26.5923 KOps/s | 26.7400 KOps/s | |
test_values_nested_locked | 67.8210μs | 39.5344μs | 25.2945 KOps/s | 25.5350 KOps/s | |
test_values_nested_leaf | 66.9510μs | 42.6983μs | 23.4201 KOps/s | 23.6952 KOps/s | |
test_values_stack_nested | 80.4210μs | 37.7361μs | 26.4998 KOps/s | 26.5918 KOps/s | |
test_values_stack_nested_leaf | 79.8710μs | 42.9140μs | 23.3024 KOps/s | 23.5651 KOps/s | |
test_values_stack_nested_locked | 65.7310μs | 39.4088μs | 25.3751 KOps/s | 25.4754 KOps/s | |
test_membership | 1.9215μs | 0.5023μs | 1.9908 MOps/s | 2.0077 MOps/s | |
test_membership_nested | 13.0900μs | 1.9312μs | 517.8253 KOps/s | 514.9152 KOps/s | |
test_membership_nested_leaf | 14.1500μs | 1.9259μs | 519.2259 KOps/s | 518.9741 KOps/s | |
test_membership_stacked_nested | 28.3510μs | 2.0203μs | 494.9663 KOps/s | 489.6993 KOps/s | |
test_membership_stacked_nested_leaf | 38.3110μs | 1.9859μs | 503.5537 KOps/s | 489.1505 KOps/s | |
test_membership_nested_last | 38.6910μs | 2.9654μs | 337.2251 KOps/s | 329.6755 KOps/s | |
test_membership_nested_leaf_last | 29.7310μs | 2.9670μs | 337.0405 KOps/s | 332.2169 KOps/s | |
test_membership_stacked_nested_last | 32.6700μs | 3.0275μs | 330.3057 KOps/s | 332.5683 KOps/s | |
test_membership_stacked_nested_leaf_last | 26.9700μs | 2.9441μs | 339.6646 KOps/s | 335.2174 KOps/s | |
test_nested_getleaf | 53.5410μs | 6.1705μs | 162.0612 KOps/s | 160.4807 KOps/s | |
test_nested_get | 35.2710μs | 5.9538μs | 167.9600 KOps/s | 168.7198 KOps/s | |
test_stacked_getleaf | 52.1310μs | 6.1276μs | 163.1973 KOps/s | 162.5444 KOps/s | |
test_stacked_get | 40.5400μs | 5.6910μs | 175.7153 KOps/s | 173.6307 KOps/s | |
test_nested_getitemleaf | 42.7110μs | 6.3521μs | 157.4273 KOps/s | 157.7651 KOps/s | |
test_nested_getitem | 36.6410μs | 6.0756μs | 164.5940 KOps/s | 164.3603 KOps/s | |
test_stacked_getitemleaf | 32.9710μs | 6.3443μs | 157.6215 KOps/s | 156.3463 KOps/s | |
test_stacked_getitem | 52.6520μs | 5.9467μs | 168.1593 KOps/s | 167.2429 KOps/s | |
test_lock_nested | 9.5859ms | 0.3466ms | 2.8852 KOps/s | 3.0010 KOps/s | |
test_lock_stack_nested | 0.4004ms | 0.3425ms | 2.9198 KOps/s | 2.9406 KOps/s | |
test_unlock_nested | 0.3699ms | 0.2831ms | 3.5322 KOps/s | 3.6103 KOps/s | |
test_unlock_stack_nested | 0.3478ms | 0.2822ms | 3.5433 KOps/s | 3.5756 KOps/s | |
test_flatten_speed | 0.1105ms | 77.7415μs | 12.8631 KOps/s | 12.9062 KOps/s | |
test_unflatten_speed | 0.3850ms | 0.3216ms | 3.1098 KOps/s | 3.1292 KOps/s | |
test_common_ops | 0.7680ms | 0.6125ms | 1.6327 KOps/s | 1.6499 KOps/s | |
test_creation | 94.2920μs | 1.6893μs | 591.9454 KOps/s | 589.4021 KOps/s | |
test_creation_empty | 34.4610μs | 9.0362μs | 110.6664 KOps/s | 128.6991 KOps/s | |
test_creation_nested_1 | 47.3200μs | 10.6285μs | 94.0867 KOps/s | 105.0088 KOps/s | |
test_creation_nested_2 | 46.2710μs | 13.2350μs | 75.5572 KOps/s | 83.7115 KOps/s | |
test_clone | 40.2900μs | 10.4884μs | 95.3430 KOps/s | 92.8192 KOps/s | |
test_getitem[int] | 1.2547ms | 10.4061μs | 96.0970 KOps/s | 95.3588 KOps/s | |
test_getitem[slice_int] | 0.1070ms | 20.1698μs | 49.5792 KOps/s | 48.3197 KOps/s | |
test_getitem[range] | 0.1282ms | 37.3274μs | 26.7900 KOps/s | 26.8113 KOps/s | |
test_getitem[tuple] | 0.1072ms | 17.4337μs | 57.3603 KOps/s | 57.4747 KOps/s | |
test_getitem[list] | 0.1291ms | 33.3453μs | 29.9892 KOps/s | 30.4040 KOps/s | |
test_setitem_dim[int] | 41.7710μs | 19.2836μs | 51.8576 KOps/s | 52.3451 KOps/s | |
test_setitem_dim[slice_int] | 92.6320μs | 37.6447μs | 26.5642 KOps/s | 27.4353 KOps/s | |
test_setitem_dim[range] | 96.8010μs | 52.0787μs | 19.2017 KOps/s | 19.1421 KOps/s | |
test_setitem_dim[tuple] | 56.4000μs | 31.6095μs | 31.6361 KOps/s | 31.5237 KOps/s | |
test_setitem | 52.3110μs | 15.2301μs | 65.6595 KOps/s | 66.1077 KOps/s | |
test_set | 72.7310μs | 14.8300μs | 67.4311 KOps/s | 68.0815 KOps/s | |
test_set_shared | 0.5231ms | 0.1563ms | 6.3983 KOps/s | 6.2067 KOps/s | |
test_update | 0.3466ms | 18.1732μs | 55.0261 KOps/s | 58.0385 KOps/s | |
test_update_nested | 56.7000μs | 24.3151μs | 41.1267 KOps/s | 43.0088 KOps/s | |
test_update__nested | 0.5472ms | 25.4525μs | 39.2888 KOps/s | 39.2510 KOps/s | |
test_set_nested | 50.9810μs | 16.1690μs | 61.8467 KOps/s | 63.0495 KOps/s | |
test_set_nested_new | 61.1310μs | 19.7323μs | 50.6783 KOps/s | 55.8567 KOps/s | |
test_select | 66.1520μs | 30.5881μs | 32.6924 KOps/s | 33.8702 KOps/s | |
test_select_nested | 72.8510μs | 42.6986μs | 23.4200 KOps/s | 22.9828 KOps/s | |
test_exclude_nested | 98.9720μs | 62.2799μs | 16.0565 KOps/s | 15.9929 KOps/s | |
test_empty[True] | 0.3678ms | 0.2918ms | 3.4270 KOps/s | 3.3846 KOps/s | |
test_empty[False] | 3.3580μs | 0.8186μs | 1.2216 MOps/s | 1.2178 MOps/s | |
test_to | 85.3810μs | 54.4580μs | 18.3628 KOps/s | 17.8278 KOps/s | |
test_to_nonblocking | 82.9610μs | 46.2168μs | 21.6371 KOps/s | 21.3681 KOps/s | |
test_unbind_speed | 0.2945ms | 0.2399ms | 4.1676 KOps/s | 4.2378 KOps/s | |
test_unbind_speed_stack0 | 0.2903ms | 0.2391ms | 4.1819 KOps/s | 4.2306 KOps/s | |
test_unbind_speed_stack1 | 93.2962ms | 0.7388ms | 1.3535 KOps/s | 1.1962 KOps/s | |
test_split | 94.4214ms | 1.5949ms | 626.9951 Ops/s | 628.0203 Ops/s | |
test_chunk | 96.3043ms | 1.6125ms | 620.1494 Ops/s | 627.9458 Ops/s | |
test_consolidate[False-None] | 3.1463ms | 2.7165ms | 368.1150 Ops/s | 374.7569 Ops/s | |
test_consolidate[default-None] | 1.7999ms | 1.6904ms | 591.5890 Ops/s | 587.3412 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8356ms | 1.7289ms | 578.4183 Ops/s | 572.5341 Ops/s | |
test_consolidate_njt[False-None] | 6.8145ms | 6.3756ms | 156.8468 Ops/s | 151.5726 Ops/s | |
test_to[False-False-None] | 1.8430ms | 1.7491ms | 571.7202 Ops/s | 568.0969 Ops/s | |
test_to[True-False-None] | 1.5665ms | 1.3091ms | 763.9011 Ops/s | 748.6369 Ops/s | |
test_to[within-False-None] | 4.3764ms | 4.1097ms | 243.3268 Ops/s | 241.9526 Ops/s | |
test_to[True-default-None] | 5.3100ms | 5.0949ms | 196.2736 Ops/s | 187.6623 Ops/s | |
test_to_njt[False-False-None] | 7.0160ms | 6.7561ms | 148.0138 Ops/s | 142.4288 Ops/s | |
test_to_njt[True-False-None] | 5.8502ms | 5.3346ms | 187.4538 Ops/s | 187.8588 Ops/s | |
test_to_njt[within-False-None] | 12.7921ms | 12.2370ms | 81.7197 Ops/s | 60.2290 Ops/s | |
test_creation[device0] | 0.7364ms | 81.4964μs | 12.2705 KOps/s | 12.6158 KOps/s | |
test_creation_from_tensor | 0.4482ms | 86.0251μs | 11.6245 KOps/s | 11.9705 KOps/s | |
test_add_one[memmap_tensor0] | 0.2350ms | 6.6473μs | 150.4376 KOps/s | 146.4527 KOps/s | |
test_contiguous[memmap_tensor0] | 0.3635ms | 0.4035μs | 2.4782 MOps/s | 2.4821 MOps/s | |
test_stack[memmap_tensor0] | 37.5110μs | 4.2808μs | 233.6012 KOps/s | 229.8800 KOps/s | |
test_memmaptd_index | 1.6514ms | 0.2365ms | 4.2292 KOps/s | 4.1955 KOps/s | |
test_memmaptd_index_astensor | 0.4352ms | 0.3000ms | 3.3338 KOps/s | 3.3204 KOps/s | |
test_memmaptd_index_op | 0.7517ms | 0.5988ms | 1.6701 KOps/s | 1.7544 KOps/s | |
test_serialize_model | 0.1338s | 0.1309s | 7.6405 Ops/s | 7.6433 Ops/s | |
test_serialize_model_pickle | 1.3474s | 1.2176s | 0.8213 Ops/s | 0.8211 Ops/s | |
test_serialize_weights | 0.1312s | 0.1301s | 7.6864 Ops/s | 7.6654 Ops/s | |
test_serialize_weights_returnearly | 0.3050s | 52.3369ms | 19.1070 Ops/s | 23.5621 Ops/s | |
test_serialize_weights_pickle | 1.4083s | 1.2244s | 0.8167 Ops/s | 0.8206 Ops/s | |
test_reshape_pytree | 53.2110μs | 21.7340μs | 46.0108 KOps/s | 46.0261 KOps/s | |
test_reshape_td | 56.4910μs | 26.9990μs | 37.0384 KOps/s | 37.7694 KOps/s | |
test_view_pytree | 48.3210μs | 21.4090μs | 46.7094 KOps/s | 46.2644 KOps/s | |
test_view_td | 59.5810μs | 31.6604μs | 31.5852 KOps/s | 31.6919 KOps/s | |
test_unbind_pytree | 64.7210μs | 27.7880μs | 35.9867 KOps/s | 35.9775 KOps/s | |
test_unbind_td | 0.6334ms | 35.8703μs | 27.8782 KOps/s | 27.7398 KOps/s | |
test_split_pytree | 63.4120μs | 29.3322μs | 34.0922 KOps/s | 33.5883 KOps/s | |
test_split_td | 0.7517ms | 37.7292μs | 26.5047 KOps/s | 25.7793 KOps/s | |
test_add_pytree | 63.1610μs | 34.2460μs | 29.2005 KOps/s | 28.8034 KOps/s | |
test_add_td | 0.1058ms | 50.2023μs | 19.9194 KOps/s | 19.9520 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1923ms | 0.1272ms | 7.8633 KOps/s | 7.9597 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2309ms | 0.1314ms | 7.6099 KOps/s | 7.5551 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1484ms | 97.2944μs | 10.2781 KOps/s | 10.4776 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.3435ms | 0.1588ms | 6.2960 KOps/s | 6.6117 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 64.6910μs | 26.3102μs | 38.0080 KOps/s | 40.4648 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1108ms | 28.6819μs | 34.8652 KOps/s | 33.6078 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3956ms | 63.0016μs | 15.8726 KOps/s | 15.7799 KOps/s | |
test_compile_copy_nested[pytree-eager] | 78.7320μs | 48.1561μs | 20.7658 KOps/s | 20.3075 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1960ms | 0.1420ms | 7.0407 KOps/s | 7.1228 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3172ms | 0.2168ms | 4.6133 KOps/s | 4.6409 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1477ms | 98.3215μs | 10.1707 KOps/s | 9.8480 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1282ms | 56.8553μs | 17.5885 KOps/s | 17.5331 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2129ms | 0.1413ms | 7.0786 KOps/s | 7.3141 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6622ms | 0.5200ms | 1.9229 KOps/s | 2.0400 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4055ms | 0.2609ms | 3.8324 KOps/s | 3.8189 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2106ms | 0.1476ms | 6.7757 KOps/s | 6.7126 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1716ms | 68.3117μs | 14.6388 KOps/s | 14.3685 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1660ms | 0.1023ms | 9.7759 KOps/s | 9.9033 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4996ms | 0.4302ms | 2.3248 KOps/s | 2.4124 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1893ms | 0.1393ms | 7.1771 KOps/s | 7.2752 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1104ms | 19.7943μs | 50.5197 KOps/s | 55.5974 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 60.4510μs | 31.2070μs | 32.0441 KOps/s | 31.8379 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1044ms | 68.6990μs | 14.5562 KOps/s | 14.6189 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1321ms | 51.5606μs | 19.3947 KOps/s | 19.2954 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.5956ms | 0.3861ms | 2.5901 KOps/s | 2.2508 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.9182ms | 2.6604ms | 375.8795 Ops/s | 380.2046 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6399ms | 0.4505ms | 2.2199 KOps/s | 2.3080 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8044ms | 2.6604ms | 375.8864 Ops/s | 369.8953 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1754ms | 0.1196ms | 8.3594 KOps/s | 8.6507 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5619ms | 78.5564μs | 12.7297 KOps/s | 11.8075 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1671ms | 0.1105ms | 9.0470 KOps/s | 9.0701 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1707ms | 67.9126μs | 14.7248 KOps/s | 13.9110 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2074ms | 0.1129ms | 8.8546 KOps/s | 9.2942 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1634ms | 69.0021μs | 14.4923 KOps/s | 13.8934 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1883ms | 0.1004ms | 9.9572 KOps/s | 10.1059 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1549ms | 16.7801μs | 59.5944 KOps/s | 55.9972 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2090ms | 95.1705μs | 10.5075 KOps/s | 10.6613 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 55.2510μs | 15.2929μs | 65.3898 KOps/s | 64.5464 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1429ms | 95.9909μs | 10.4177 KOps/s | 10.1793 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 51.6800μs | 15.2460μs | 65.5911 KOps/s | 65.5256 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1683ms | 99.7014μs | 10.0299 KOps/s | 9.9567 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6867ms | 16.7418μs | 59.7306 KOps/s | 59.2974 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1521ms | 94.5637μs | 10.5749 KOps/s | 10.6362 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 64.4610μs | 15.2537μs | 65.5578 KOps/s | 64.7884 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1480ms | 94.5222μs | 10.5795 KOps/s | 10.3810 KOps/s | |
test_compile_indexing[int-pytree-eager] | 51.1400μs | 15.3643μs | 65.0860 KOps/s | 64.7318 KOps/s | |
test_mod_add[eager] | 0.1179ms | 37.9223μs | 26.3697 KOps/s | 26.7276 KOps/s | |
test_mod_add[compile] | 0.1392ms | 79.6207μs | 12.5596 KOps/s | 11.7003 KOps/s | |
test_mod_add[compile-overhead] | 0.3246ms | 0.1660ms | 6.0250 KOps/s | 5.7417 KOps/s | |
test_mod_wrap[eager] | 0.3262ms | 0.2434ms | 4.1082 KOps/s | 3.7341 KOps/s | |
test_mod_wrap[compile] | 0.6896ms | 0.2773ms | 3.6062 KOps/s | 3.4273 KOps/s | |
test_mod_wrap[compile-overhead] | 7.3449ms | 3.8558ms | 259.3467 Ops/s | 263.1443 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.7474ms | 1.3518ms | 739.7794 Ops/s | 686.5008 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3791ms | 1.2595ms | 793.9851 Ops/s | 719.9184 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.4107ms | 0.9263ms | 1.0796 KOps/s | 974.7987 Ops/s | |
test_seq_add[eager] | 0.1645ms | 0.1150ms | 8.6929 KOps/s | 8.8235 KOps/s | |
test_seq_add[compile] | 0.1453ms | 90.0297μs | 11.1074 KOps/s | 10.5199 KOps/s | |
test_seq_add[compile-overhead] | 0.1807ms | 0.1284ms | 7.7881 KOps/s | 7.7382 KOps/s | |
test_seq_wrap[eager] | 0.5176ms | 0.4252ms | 2.3516 KOps/s | 2.3404 KOps/s | |
test_seq_wrap[compile] | 0.3630ms | 0.2971ms | 3.3663 KOps/s | 3.3157 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2708ms | 0.2235ms | 4.4736 KOps/s | 4.4537 KOps/s | |
test_func_call_runtime[False-eager] | 0.7906ms | 0.7226ms | 1.3838 KOps/s | 1.3414 KOps/s | |
test_func_call_runtime[False-compile] | 1.0317ms | 0.7379ms | 1.3552 KOps/s | 1.3387 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4254ms | 0.3607ms | 2.7724 KOps/s | 2.7752 KOps/s | |
test_func_call_runtime[True-eager] | 0.9986ms | 0.8849ms | 1.1300 KOps/s | 1.0910 KOps/s | |
test_func_call_runtime[True-compile] | 0.9039ms | 0.7597ms | 1.3164 KOps/s | 1.2956 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4488ms | 0.3815ms | 2.6214 KOps/s | 2.6254 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7832ms | 0.7122ms | 1.4040 KOps/s | 1.3488 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8138ms | 0.7371ms | 1.3567 KOps/s | 1.3228 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4240ms | 0.3617ms | 2.7647 KOps/s | 2.7545 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1041ms | 0.9822ms | 1.0182 KOps/s | 982.1228 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.1887ms | 0.9765ms | 1.0240 KOps/s | 999.1567 Ops/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.0770ms | 0.9801ms | 1.0203 KOps/s | 990.8881 Ops/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4990ms | 2.0878ms | 478.9682 Ops/s | 472.2209 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9018ms | 0.8063ms | 1.2403 KOps/s | 1.2129 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4863ms | 0.4111ms | 2.4323 KOps/s | 2.4130 KOps/s | |
test_distributed | 0.6823ms | 0.1662ms | 6.0161 KOps/s | 8.7705 KOps/s | |
test_tdmodule | 0.2907ms | 20.5914μs | 48.5639 KOps/s | 50.0057 KOps/s | |
test_tdmodule_dispatch | 56.0810μs | 36.3669μs | 27.4976 KOps/s | 28.7273 KOps/s | |
test_tdseq | 39.1810μs | 20.7726μs | 48.1403 KOps/s | 49.3381 KOps/s | |
test_tdseq_dispatch | 76.4410μs | 39.4265μs | 25.3636 KOps/s | 26.1587 KOps/s | |
test_instantiation_functorch | 1.6754ms | 1.5220ms | 657.0203 Ops/s | 644.5842 Ops/s | |
test_exec_functorch | 0.1931ms | 0.1415ms | 7.0673 KOps/s | 6.9384 KOps/s | |
test_exec_functional_call | 0.1864ms | 0.1358ms | 7.3615 KOps/s | 7.2601 KOps/s | |
test_exec_td_decorator | 0.3786ms | 0.1850ms | 5.4067 KOps/s | 5.2583 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8053ms | 0.6841ms | 1.4618 KOps/s | 1.4486 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7942ms | 0.6839ms | 1.4623 KOps/s | 1.4490 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7115ms | 0.5962ms | 1.6774 KOps/s | 1.6697 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7076ms | 0.5948ms | 1.6812 KOps/s | 1.6663 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.8105ms | 19.1685ms | 52.1689 Ops/s | 51.6709 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.5093ms | 19.1809ms | 52.1353 Ops/s | 51.7575 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.2079ms | 19.0202ms | 52.5758 Ops/s | 52.1825 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.2431ms | 19.0516ms | 52.4889 Ops/s | 52.0909 Ops/s | |
test_to_module_speed[True] | 1.4349ms | 0.9646ms | 1.0367 KOps/s | 1.0374 KOps/s | |
test_to_module_speed[False] | 1.0289ms | 0.9523ms | 1.0501 KOps/s | 1.0525 KOps/s | |
test_tc_init | 80.7220μs | 34.8490μs | 28.6952 KOps/s | 28.1963 KOps/s | |
test_tc_init_nested | 0.1046ms | 67.6050μs | 14.7918 KOps/s | 14.3625 KOps/s | |
test_tc_first_layer_tensor | 33.5610μs | 0.7934μs | 1.2604 MOps/s | 1.4219 MOps/s | |
test_tc_first_layer_nontensor | 34.4800μs | 2.1957μs | 455.4277 KOps/s | 453.7813 KOps/s | |
test_tc_second_layer_tensor | 8.5625μs | 1.4061μs | 711.1922 KOps/s | 712.7487 KOps/s | |
test_tc_second_layer_nontensor | 31.4000μs | 2.9148μs | 343.0778 KOps/s | 344.3718 KOps/s | |
test_unbind | 7.0738ms | 6.8606ms | 145.7592 Ops/s | 145.6527 Ops/s | |
test_full_like | 12.9137ms | 9.1170ms | 109.6848 Ops/s | 108.2454 Ops/s | |
test_zeros_like | 5.8487ms | 4.2624ms | 234.6095 Ops/s | 231.8154 Ops/s | |
test_ones_like | 4.5168ms | 4.2204ms | 236.9425 Ops/s | 232.0774 Ops/s | |
test_clone | 11.2060ms | 9.0059ms | 111.0382 Ops/s | 158.5309 Ops/s | |
test_squeeze | 61.3610μs | 9.4156μs | 106.2065 KOps/s | 106.3141 KOps/s | |
test_unsqueeze | 0.1230ms | 73.3248μs | 13.6380 KOps/s | 13.3711 KOps/s | |
test_split | 0.2099s | 0.2148ms | 4.6564 KOps/s | 6.1617 KOps/s | |
test_permute | 0.2256ms | 0.1766ms | 5.6610 KOps/s | 5.5949 KOps/s | |
test_stack | 50.9488ms | 49.9849ms | 20.0061 Ops/s | 19.9810 Ops/s | |
test_cat | 50.4419ms | 49.6511ms | 20.1406 Ops/s | 24.0099 Ops/s |
vmoens
added a commit
that referenced
this pull request
Feb 20, 2025
ghstack-source-id: 1c5cbcc87c0c076059d3c6c822d996c32d8f3195 Pull Request resolved: #1226
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
documentation
Improvements or additions to documentation
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):