Bilal Piot

Trích dẫn bởi

	Tất cả	Từ 2019
Trích dẫn	17211	16357
h-index	37	35
i10-index	49	47

4500

2250

1125

3375

2014201520162017201820192020202120222023202448 45 91 130 470 851 1321 2466 3707 4465 3521

Truy cập công khai

Xem tất cả

3 bài viết

0 bài viết

có sẵn

không có sẵn

Dựa trên yêu cầu tài trợ

Đồng tác giả

Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Email được xác minh tại univ-lille.fr
Mohammad Gheshlaghi AzarCohereEmail được xác minh tại cohere.com
Zhaohan Daniel GuoDeepMindEmail được xác minh tại google.com
Rémi MunosGoogle DeepMindEmail được xác minh tại inria.fr
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindEmail được xác minh tại meta.com
Florent AltchéResearch Engineer, DeepMindEmail được xác minh tại google.com
Jean-bastien GrillEmail được xác minh tại google.com
Florian STRUBCohereEmail được xác minh tại cohere.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Email được xác minh tại univ-lorraine.fr
Corentin TallecDeepMindEmail được xác minh tại google.com
Pierre RichemondGoogle DeepMindEmail được xác minh tại deepmind.com
Charles BlundellResearch Scientist at DeepMindEmail được xác minh tại google.com
Todd HesterWaymoEmail được xác minh tại waymo.com
Pablo SprechmannResearch Scientist at Google DeepMindEmail được xác minh tại google.com
Steven KapturowskiDeepMindEmail được xác minh tại google.com
Mel VecerikDeepMind, University College LondonEmail được xác minh tại ucl.ac.uk
Dan HorganGoogle DeepMindEmail được xác minh tại google.com
Adrià Puigdomènech BadiaDeepMindEmail được xác minh tại google.com
Alex VitvitskyiDeepMindEmail được xác minh tại google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLEmail được xác minh tại google.com

Theo dõi

Bilal Piot

Google Deepmind

Email được xác minh tại google.com

reinforcement learning inverse reinforcement learning


Tiêu đề Sắp xếp theo số lượt trích dẫn Sắp xếp theo năm Sắp xếp theo tiêu đề	Trích dẫn bởi Trích dẫn bởi	Năm
Bootstrap your own latent: A new approach to self-supervised learning JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ... arXiv preprint arXiv:2006.07733, 2020	6469	2020
Rainbow: Combining improvements in deep reinforcement learning M Hessel, J Modayil, H Van Hasselt, T Schaul, G Ostrovski, W Dabney, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2720	2018
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	1262	2018
Noisy Networks for Exploration M Fortunato, MG Azar, B Piot, J Menick, I Osband, A Graves, V Mnih, ... arXiv preprint arXiv:1706.10295 2018, 2017	1205*	2017
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards M Vecerik, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ... arXiv preprint arXiv:1707.08817, 2017	808	2017
Agent57: Outperforming the atari human benchmark AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ... International conference on machine learning, 507-517, 2020	671	2020
k. kavukcuoglu, R JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ... Munos, and M. Valko,“Bootstrap your own latent-a new approach to self …, 2020	474*	2020
Never give up: Learning directed exploration strategies AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020	350	2020
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	249	2020
Mastering the game of Stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	195	2022
A general theoretical paradigm to understand learning from human preferences MG Azar, ZD Guo, B Piot, R Munos, M Rowland, M Valko, D Calandriello International Conference on Artificial Intelligence and Statistics, 4447-4455, 2024	188	2024
Learning from demonstrations for real world reinforcement learning T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, A Sendonaris, ... arXiv preprint arXiv:1704.03732, 2017	180	2017
Bootstrap latent-predictive representations for multitask reinforcement learning ZD Guo, BA Pires, B Piot, JB Grill, F Altché, R Munos, MG Azar International Conference on Machine Learning, 3875-3886, 2020	152	2020
Observe and look further: Achieving consistent performance on atari T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ... arXiv preprint arXiv:1805.11593, 2018	139	2018
Inverse reinforcement learning through structured classification E Klein, M Geist, B Piot, O Pietquin Advances in neural information processing systems 25, 2012	123	2012
Approximate dynamic programming for two-player zero-sum Markov games J Perolat, B Scherrer, B Piot, O Pietquin International Conference on Machine Learning, 1321-1329, 2015	122	2015
Bridging the gap between imitation learning and inverse reinforcement learning B Piot, M Geist, O Pietquin IEEE transactions on neural networks and learning systems 28 (8), 1814-1826, 2016	110	2016
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning A Gruslys, W Dabney, MG Azar, B Piot, M Bellemare, R Munos arXiv preprint arXiv:1704.04651, 2017	106	2017
Byol works even without batch statistics PH Richemond, JB Grill, F Altché, C Tallec, F Strub, A Brock, S Smith, ... arXiv preprint arXiv:2010.10241, 2020	99	2020
Hindsight credit assignment A Harutyunyan, W Dabney, T Mesnard, M Gheshlaghi Azar, B Piot, ... Advances in neural information processing systems 32, 2019	97	2019

Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.

Bài viết 1–20

Trích dẫn mỗi năm

Trích dẫn trùng lặp

Trích dẫn được hợp nhất

Thêm đồng tác giảĐồng tác giả

Theo dõi

Trích dẫn bởi

Đồng tác giả