default search action

combined dblp search
author search
venue search
publication search

ask others

Piotr Milos

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ZhaoQSTLMWM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhaoQSTLMWM24
Yu Zhao, Yuanbin Qu, Konrad Staniszewski, Szymon Tworkowski, Wei Liu, Piotr Milos, Yuxiang Wu, Pasquale Minervini:
Analysing The Impact of Sequence Composition on Language Model Pre-Training. ACL (1) 2024: 7897-7912
[c20]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/MikulaTAPJZSKMW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MikulaTAPJZSKMW24
Maciej Mikula, Szymon Tworkowski, Szymon Antoniak, Bartosz Piotrowski, Albert Q. Jiang, Jin Peng Zhou, Christian Szegedy, Lukasz Kucinski, Piotr Milos, Yuhuai Wu:
Magnushammer: A Transformer-Based Approach to Premise Selection. ICLR 2024
[c19]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/NaumanBMTOC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/NaumanBMTOC24
Michal Nauman, Michal Bortkiewicz, Piotr Milos, Tomasz Trzcinski, Mateusz Ostaszewski, Marek Cygan:
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning. ICML 2024
[c18]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/WolczykCOB0PKM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WolczykCOB0PKM24
Maciej Wolczyk, Bartlomiej Cupial, Mateusz Ostaszewski, Michal Bortkiewicz, Michal Zajac, Razvan Pascanu, Lukasz Kucinski, Piotr Milos:
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem. ICML 2024
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-02868
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-02868
Maciej Wolczyk, Bartlomiej Cupial, Mateusz Ostaszewski, Michal Bortkiewicz, Michal Zajac, Razvan Pascanu, Lukasz Kucinski, Piotr Milos:
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem. CoRR abs/2402.02868 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-13991
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-13991
Yu Zhao, Yuanbin Qu, Konrad Staniszewski, Szymon Tworkowski, Wei Liu, Piotr Milos, Yuxiang Wu, Pasquale Minervini:
Analysing The Impact of Sequence Composition on Language Model Pre-Training. CoRR abs/2402.13991 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-00514
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-00514
Michal Nauman, Michal Bortkiewicz, Mateusz Ostaszewski, Piotr Milos, Tomasz Trzcinski, Marek Cygan:
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning. CoRR abs/2403.00514 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-05713
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-05713
Lukasz Kucinski, Witold Drzewakowski, Mateusz Olko, Piotr Kozakowski, Lukasz Maziarka, Marta Emilia Nowakowska, Lukasz Kaiser, Piotr Milos:
tsGT: Stochastic Time Series Modeling With Transformer. CoRR abs/2403.05713 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-16158
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-16158
Michal Nauman, Mateusz Ostaszewski, Krzysztof Jankowski, Piotr Milos, Marek Cygan:
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control. CoRR abs/2405.16158 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-03361
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-03361
Michal Zawalski, Gracjan Góral, Michal Tyrolski, Emilia Wisnios, Franciszek Budrowski, Lukasz Kucinski, Piotr Milos:
What Matters in Hierarchical Search for Combinatorial Reasoning Problems? CoRR abs/2406.03361 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04165
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04165
Alicja Ziarko, Albert Q. Jiang, Bartosz Piotrowski, Wenda Li, Mateja Jamnik, Piotr Milos:
Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe. CoRR abs/2406.04165 (2024)
2023
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/collas/KesslerOBZWPRM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/collas/KesslerOBZWPRM23
Samuel Kessler, Mateusz Ostaszewski, Michal Pawel Bortkiewicz, Mateusz Zarski, Maciej Wolczyk, Jack Parker-Holder, Stephen J. Roberts, Piotr Milos:
The Effectiveness of World Models for Continual Reinforcement Learning. CoLLAs 2023: 184-204
[c16]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ZawalskiTCOSPWK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZawalskiTCOSPWK23
Michal Zawalski, Michal Tyrolski, Konrad Czechowski, Tomasz Odrzygózdz, Damian Stachura, Piotr Piekos, Yuhuai Wu, Lukasz Kucinski, Piotr Milos:
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search. ICLR 2023
[c15]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/MasarczykOIPMT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MasarczykOIPMT23
Wojciech Masarczyk, Mateusz Ostaszewski, Ehsan Imani, Razvan Pascanu, Piotr Milos, Tomasz Trzcinski:
The Tunnel Effect: Building Data Representations in Deep Neural Networks. NeurIPS 2023
[c14]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Olko00SABKM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Olko00SABKM23
Mateusz Olko, Michal Zajac, Aleksandra Nowak, Nino Scherrer, Yashas Annadani, Stefan Bauer, Lukasz Kucinski, Piotr Milos:
Trust Your 𝛁: Gradient-based Intervention Targeting for Causal Discovery. NeurIPS 2023
[c13]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/TworkowskiSPWMM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TworkowskiSPWMM23
Szymon Tworkowski, Konrad Staniszewski, Mikolaj Pacek, Yuhuai Wu, Henryk Michalewski, Piotr Milos:
Focused Transformer: Contrastive Training for Context Scaling. NeurIPS 2023
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-04488
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-04488
Maciej Mikula, Szymon Antoniak, Szymon Tworkowski, Albert Qiaochu Jiang, Jin Peng Zhou, Christian Szegedy, Lukasz Kucinski, Piotr Milos, Yuhuai Wu:
Magnushammer: A Transformer-based Approach to Premise Selection. CoRR abs/2303.04488 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-15342
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-15342
Michal Zajac, Kamil Deja, Anna Kuzina, Jakub M. Tomczak, Tomasz Trzcinski, Florian Shkurti, Piotr Milos:
Exploring Continual Learning of Diffusion Models. CoRR abs/2303.15342 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19753
Wojciech Masarczyk, Mateusz Ostaszewski, Ehsan Imani, Razvan Pascanu, Piotr Milos, Tomasz Trzcinski:
The Tunnel Effect: Building Data Representations in Deep Neural Networks. CoRR abs/2305.19753 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-03170
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-03170
Szymon Tworkowski, Konrad Staniszewski, Mikolaj Pacek, Yuhuai Wu, Henryk Michalewski, Piotr Milos:
Focused Transformer: Contrastive Training for Context Scaling. CoRR abs/2307.03170 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-17296
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-17296
Konrad Staniszewski, Szymon Tworkowski, Sebastian Jaszczur, Henryk Michalewski, Lukasz Kucinski, Piotr Milos:
Structured Packing in LLM Training Improves Long Context Utilization. CoRR abs/2312.17296 (2023)
2022
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/ZawalskiOMM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/ZawalskiOMM22
Michal Zawalski, Blazej Osinski, Henryk Michalewski, Piotr Milos:
Off-Policy Correction For Multi-Agent Reinforcement Learning. AAMAS 2022: 1774-1776
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/KozakowskiPM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/KozakowskiPM22
Piotr Kozakowski, Mikolaj Pacek, Piotr Milos:
Planning and Learning using Adaptive Entropy Tree Search. IJCNN 2022: 1-8
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/JiangLTCOMWJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JiangLTCOMWJ22
Albert Qiaochu Jiang, Wenda Li, Szymon Tworkowski, Konrad Czechowski, Tomasz Odrzygózdz, Piotr Milos, Yuhuai Wu, Mateja Jamnik:
Thor: Wielding Hammers to Integrate Language Models and Automated Theorem Provers. NeurIPS 2022
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Wolczyk0PKM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Wolczyk0PKM22
Maciej Wolczyk, Michal Zajac, Razvan Pascanu, Lukasz Kucinski, Piotr Milos:
Disentangling Transfer in Continual Reinforcement Learning. NeurIPS 2022
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10893
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10893
Albert Q. Jiang, Wenda Li, Szymon Tworkowski, Konrad Czechowski, Tomasz Odrzygózdz, Piotr Milos, Yuhuai Wu, Mateja Jamnik:
Thor: Wielding Hammers to Integrate Language Models and Automated Theorem Provers. CoRR abs/2205.10893 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00702
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00702
Michal Zawalski, Michal Tyrolski, Konrad Czechowski, Damian Stachura, Piotr Piekos, Tomasz Odrzygózdz, Yuhuai Wu, Lukasz Kucinski, Piotr Milos:
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search. CoRR abs/2206.00702 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-13900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-13900
Maciej Wolczyk, Michal Zajac, Razvan Pascanu, Lukasz Kucinski, Piotr Milos:
Disentangling Transfer in Continual Reinforcement Learning. CoRR abs/2209.13900 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-13715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-13715
Mateusz Olko, Michal Zajac, Aleksandra Nowak, Nino Scherrer, Yashas Annadani, Stefan Bauer, Lukasz Kucinski, Piotr Milos:
Trust Your ∇: Gradient-based Intervention Targeting for Causal Discovery. CoRR abs/2211.13715 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-15944
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-15944
Samuel Kessler, Piotr Milos, Jack Parker-Holder, Stephen J. Roberts:
The Surprising Effectiveness of Latent World Models for Continual Reinforcement Learning. CoRR abs/2211.15944 (2022)
2021
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/CzechowskiJKKM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/CzechowskiJKKM21
Konrad Czechowski, Piotr Januszewski, Piotr Kozakowski, Lukasz Kucinski, Piotr Milos:
Structure and Randomness in Planning and Reinforcement Learning. IJCNN 2021: 1-8
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/CzechowskiOIZKM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/CzechowskiOIZKM21
Konrad Czechowski, Tomasz Odrzygózdz, Michal Izworski, Marek Zbysinski, Lukasz Kucinski, Piotr Milos:
Trust, but Verify: Alleviating Pessimistic Errors in Model-Based Exploration. IJCNN 2021: 1-8
[c6]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/CzechowskiOZZOW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CzechowskiOZZOW21
Konrad Czechowski, Tomasz Odrzygózdz, Marek Zbysinski, Michal Zawalski, Krzysztof Olejnik, Yuhuai Wu, Lukasz Kucinski, Piotr Milos:
Subgoal Search For Complex Reasoning Tasks. NeurIPS 2021: 624-638
[c5]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/KucinskiKKM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KucinskiKKM21
Lukasz Kucinski, Tomasz Korbak, Pawel Kolodziej, Piotr Milos:
Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication. NeurIPS 2021: 23075-23088
[c4]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/WolczykZPKM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WolczykZPKM21
Maciej Wolczyk, Michal Zajac, Razvan Pascanu, Lukasz Kucinski, Piotr Milos:
Continual World: A Robotic Benchmark For Continual Reinforcement Learning. NeurIPS 2021: 28496-28510
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-06808
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-06808
Piotr Kozakowski, Mikolaj Pacek, Piotr Milos:
Robust and Efficient Planning using Adaptive Entropy Tree Search. CoRR abs/2102.06808 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-10919
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-10919
Maciej Wolczyk, Michal Zajac, Razvan Pascanu, Lukasz Kucinski, Piotr Milos:
Continual World: A Robotic Benchmark For Continual Reinforcement Learning. CoRR abs/2105.10919 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-11204
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-11204
Konrad Czechowski, Tomasz Odrzygózdz, Marek Zbysinski, Michal Zawalski, Krzysztof Olejnik, Yuhuai Wu, Lukasz Kucinski, Piotr Milos:
Subgoal Search For Complex Reasoning Tasks. CoRR abs/2108.11204 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-06464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-06464
Lukasz Kucinski, Tomasz Korbak, Pawel Kolodziej, Piotr Milos:
Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication. CoRR abs/2111.06464 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-11229
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-11229
Michal Zawalski, Blazej Osinski, Henryk Michalewski, Piotr Milos:
Off-Policy Correction For Multi-Agent Reinforcement Learning. CoRR abs/2111.11229 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-15382
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-15382
Piotr Januszewski, Mateusz Olko, Michal Królikowski, Jakub Swiatkowski, Marcin Andrychowicz, Lukasz Kucinski, Piotr Milos:
Continuous Control With Ensemble Deep Deterministic Policy Gradients. CoRR abs/2111.15382 (2021)
2020
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/KaiserBMOCCEFKL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KaiserBMOCCEFKL20
Lukasz Kaiser, Mohammad Babaeizadeh, Piotr Milos, Blazej Osinski, Roy H. Campbell, Konrad Czechowski, Dumitru Erhan, Chelsea Finn, Piotr Kozakowski, Sergey Levine, Afroz Mohiuddin, Ryan Sepassi, George Tucker, Henryk Michalewski:
Model Based Reinforcement Learning for Atari. ICLR 2020
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/OsinskiJZMGHM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/OsinskiJZMGHM20
Blazej Osinski, Adam Jakubowski, Pawel Ziecina, Piotr Milos, Christopher Galias, Silviu Homoceanu, Henryk Michalewski:
Simulation-Based Reinforcement Learning for Real-World Autonomous Driving. ICRA 2020: 6411-6418
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-11329
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-11329
Blazej Osinski, Piotr Milos, Adam Jakubowski, Pawel Ziecina, Michal Martyniak, Christopher Galias, Antonia Breuer, Silviu Homoceanu, Henryk Michalewski:
CARLA Real Traffic Scenarios - novel training ground and benchmark for autonomous driving. CoRR abs/2012.11329 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1903-00374
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-00374
Lukasz Kaiser, Mohammad Babaeizadeh, Piotr Milos, Blazej Osinski, Roy H. Campbell, Konrad Czechowski, Dumitru Erhan, Chelsea Finn, Piotr Kozakowski, Sergey Levine, Ryan Sepassi, George Tucker, Henryk Michalewski:
Model-Based Reinforcement Learning for Atari. CoRR abs/1903.00374 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-06079
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-06079
Tomasz Korbak, Julian Zubek, Lukasz Kucinski, Piotr Milos, Joanna Raczaszek-Leonardi:
Developmentally motivated emergence of compositional communication via template transfer. CoRR abs/1910.06079 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-12905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-12905
Blazej Osinski, Adam Jakubowski, Piotr Milos, Pawel Ziecina, Christopher Galias, Silviu Homoceanu, Henryk Michalewski:
Simulation-based reinforcement learning for real-world autonomous driving. CoRR abs/1911.12905 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-09996
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-09996
Piotr Milos, Lukasz Kucinski, Konrad Czechowski, Piotr Kozakowski, Maciej Klimek:
Uncertainty-sensitive Learning and Planning with Ensembles. CoRR abs/1912.09996 (2019)
2018
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00361
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00361
Lukasz Kidzinski, Sharada Prasanna Mohanty, Carmichael F. Ong, Zhewei Huang, Shuchang Zhou, Anton Pechenko, Adam Stelmaszczyk, Piotr Jarosik, Mikhail Pavlov, Sergey Kolesnikov, Sergey M. Plis, Zhibo Chen, Zhizheng Zhang, Jiale Chen, Jun Shi, Zhuobin Zheng, Chun Yuan, Zhihui Lin, Henryk Michalewski, Piotr Milos, Blazej Osinski, Andrew Melnik, Malte Schilling, Helge J. Ritter, Sean F. Carroll, Jennifer L. Hicks, Sergey Levine, Marcel Salathé, Scott L. Delp:
Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments. CoRR abs/1804.00361 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1809-03447
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-03447
Michal Garmulewicz, Henryk Michalewski, Piotr Milos:
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge. CoRR abs/1809.03447 (2018)
2017
[c1]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/corl/KlimekMM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/KlimekMM17
Maciej Klimek, Henryk Michalewski, Piotr Milos:
Hierarchical Reinforcement Learning with Parameters. CoRL 2017: 301-313

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.