dblp: Rohin Shah

Channel: dblp: Rohin Shah

Image may be NSFW.
Clik here to view.

Chlorophyll: synthesis-aided compiler for low-power spatial architectures.

December 31, 2013, 3:00 pm

Phitchaya Mangpo Phothilimthana, Tikhon Jelvis, Rohin Shah, Nishant Totla, Sarah E. Chasins, Rastislav Bodík: Chlorophyll: synthesis-aided compiler for low-power spatial architectures. PLDI 2014: 396-407

View Article

Image may be NSFW.
Clik here to view.

SIMPL: A DSL for Automatic Specialization of Inference Algorithms.

December 31, 2015, 3:00 pm

Rohin Shah, Emina Torlak, Rastislav Bodík: SIMPL: A DSL for Automatic Specialization of Inference Algorithms. CoRR abs/1604.04729 (2016)

View Article

Image may be NSFW.
Clik here to view.

Active Inverse Reward Design.

December 31, 2017, 3:00 pm

Sören Mindermann, Rohin Shah, Adam Gleave, Dylan Hadfield-Menell: Active Inverse Reward Design. CoRR abs/1809.03060 (2018)

View Article

Image may be NSFW.
Clik here to view.

On the Utility of Learning about Humans for Human-AI Coordination.

December 31, 2018, 3:00 pm

Micah Carroll, Rohin Shah, Mark K. Ho, Thomas L. Griffiths, Sanjit A. Seshia, Pieter Abbeel, Anca D. Dragan: On the Utility of Learning about Humans for Human-AI Coordination. CoRR abs/1910.05789 (2019)

View Article

Image may be NSFW.
Clik here to view.

On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward...

December 31, 2018, 3:00 pm

Rohin Shah, Noah Gundotra, Pieter Abbeel, Anca D. Dragan: On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference. CoRR abs/1906.09624 (2019)

View Article

Image may be NSFW.
Clik here to view.

Preferences Implicit in the State of the World.

December 31, 2018, 3:00 pm

Rohin Shah, Dmitrii Krasheninnikov, Jordan Alexander, Pieter Abbeel, Anca D. Dragan: Preferences Implicit in the State of the World. CoRR abs/1902.04198 (2019)

View Article

Image may be NSFW.
Clik here to view.

On the Utility of Learning about Humans for Human-AI Coordination.

December 31, 2018, 3:00 pm

Micah Carroll, Rohin Shah, Mark K. Ho, Tom Griffiths, Sanjit A. Seshia, Pieter Abbeel, Anca D. Dragan: On the Utility of Learning about Humans for Human-AI Coordination. NeurIPS 2019: 5175-5186

View Article

Image may be NSFW.
Clik here to view.

On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward...

December 31, 2018, 3:00 pm

Rohin Shah, Noah Gundotra, Pieter Abbeel, Anca D. Dragan: On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference. ICML 2019: 5670-5679

View Article

Image may be NSFW.
Clik here to view.

Preferences Implicit in the State of the World.

December 31, 2018, 3:00 pm

Rohin Shah, Dmitrii Krasheninnikov, Jordan Alexander, Pieter Abbeel, Anca D. Dragan: Preferences Implicit in the State of the World. ICLR (Poster) 2019

View Article

Image may be NSFW.
Clik here to view.

The MAGICAL Benchmark for Robust Imitation.

December 31, 2019, 3:00 pm

Sam Toyer, Rohin Shah, Andrew Critch, Stuart Russell: The MAGICAL Benchmark for Robust Imitation. CoRR abs/2011.00401 (2020)

View Article

Image may be NSFW.
Clik here to view.

The MAGICAL Benchmark for Robust Imitation.

December 31, 2019, 3:00 pm

Sam Toyer, Rohin Shah, Andrew Critch, Stuart Russell: The MAGICAL Benchmark for Robust Imitation. NeurIPS 2020

View Article

Image may be NSFW.
Clik here to view.

Choice Set Misspecification in Reward Inference.

December 31, 2019, 3:00 pm

Rachel Freedman, Rohin Shah, Anca D. Dragan: Choice Set Misspecification in Reward Inference. AISafety@IJCAI 2020

View Article

Image may be NSFW.
Clik here to view.

Extracting and Using Preference Information from the State of the World.

December 31, 2019, 3:00 pm

Rohin Shah: Extracting and Using Preference Information from the State of the World. University of California, Berkeley, USA, 2020

View Article

Image may be NSFW.
Clik here to view.

The MineRL BASALT Competition on Learning from Human Feedback.

December 31, 2020, 3:00 pm

Rohin Shah, Cody Wild, Steven H. Wang, Neel Alex, Brandon Houghton, William H. Guss, Sharada P. Mohanty, Anssi Kanervisto, Stephanie Milani, Nicholay Topin, Pieter Abbeel, Stuart Russell, Anca D....

View Article

Image may be NSFW.
Clik here to view.

Learning What To Do by Simulating the Past.

December 31, 2020, 3:00 pm

David Lindner, Rohin Shah, Pieter Abbeel, Anca D. Dragan: Learning What To Do by Simulating the Past. CoRR abs/2104.03946 (2021)

View Article

Image may be NSFW.
Clik here to view.

Combining Reward Information from Multiple Sources.

December 31, 2020, 3:00 pm

Dmitrii Krasheninnikov, Rohin Shah, Herke van Hoof: Combining Reward Information from Multiple Sources. CoRR abs/2103.12142 (2021)

View Article

Image may be NSFW.
Clik here to view.

Choice Set Misspecification in Reward Inference.

December 31, 2020, 3:00 pm

Rachel Freedman, Rohin Shah, Anca D. Dragan: Choice Set Misspecification in Reward Inference. CoRR abs/2101.07691 (2021)

View Article

Image may be NSFW.
Clik here to view.

Evaluating the Robustness of Collaborative Agents.

December 31, 2020, 3:00 pm

Paul Knott, Micah Carroll, Sam Devlin, Kamil Ciosek, Katja Hofmann, Anca D. Dragan, Rohin Shah: Evaluating the Robustness of Collaborative Agents. CoRR abs/2101.05507 (2021)

View Article

Image may be NSFW.
Clik here to view.

Optimal Policies Tend To Seek Power.

December 31, 2020, 3:00 pm

Alexander Matt Turner, Logan Smith, Rohin Shah, Andrew Critch, Prasad Tadepalli: Optimal Policies Tend To Seek Power. NeurIPS 2021: 23063-23074

View Article

Image may be NSFW.
Clik here to view.

Retrospective on the 2021 MineRL BASALT Competition on Learning from Human...

December 31, 2020, 3:00 pm

Rohin Shah, Steven H. Wang, Cody Wild, Stephanie Milani, Anssi Kanervisto, Vinicius G. Goecks, Nicholas R. Waytowich, David Watkins-Valls, Bharat Prakash, Edmund Mills, Divyansh Garg, Alexander Fries,...

View Article

Image may be NSFW.
Clik here to view.

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the...

December 31, 2020, 3:00 pm

Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Sharada P. Mohanty, Byron Galbraith, Ke Chen, Yan Song, Tianze Zhou, Bingquan Yu, He Liu, Kai Guan, Yujing...

View Article

Image may be NSFW.
Clik here to view.

An Empirical Investigation of Representation Learning for Imitation.

December 31, 2020, 3:00 pm

Cynthia Chen, Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H. Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah: An Empirical Investigation of...

View Article

Image may be NSFW.
Clik here to view.

Learning What To Do by Simulating the Past.

December 31, 2020, 3:00 pm

David Lindner, Rohin Shah, Pieter Abbeel, Anca D. Dragan: Learning What To Do by Simulating the Past. ICLR 2021

View Article

Image may be NSFW.
Clik here to view.

Evaluating the Robustness of Collaborative Agents.

December 31, 2020, 3:00 pm

Paul Knott, Micah Carroll, Sam Devlin, Kamil Ciosek, Katja Hofmann, Anca D. Dragan, Rohin Shah: Evaluating the Robustness of Collaborative Agents. AAMAS 2021: 1560-1562

View Article

Image may be NSFW.
Clik here to view.

Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct...

December 31, 2021, 3:00 pm

Rohin Shah, Vikrant Varma, Ramana Kumar, Mary Phuong, Victoria Krakovna, Jonathan Uesato, Zac Kenton: Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals. CoRR...

View Article

Image may be NSFW.
Clik here to view.

An Empirical Investigation of Representation Learning for Imitation.

December 31, 2021, 3:00 pm

Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H. Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah: An Empirical Investigation of Representation...

View Article

Image may be NSFW.
Clik here to view.

Retrospective on the 2021 BASALT Competition on Learning from Human Feedback.

December 31, 2021, 3:00 pm

View Article

Image may be NSFW.
Clik here to view.

Challenges with unsupervised LLM knowledge discovery.

December 31, 2022, 3:00 pm

Sebastian Farquhar, Vikrant Varma, Zachary Kenton, Johannes Gasteiger, Vladimir Mikulik, Rohin Shah: Challenges with unsupervised LLM knowledge discovery. CoRR abs/2312.10029 (2023)

View Article

Image may be NSFW.
Clik here to view.

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training...

December 31, 2022, 3:00 pm

Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Rohin Shah: BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking...

View Article

Image may be NSFW.
Clik here to view.

Explaining grokking through circuit efficiency.

December 31, 2022, 3:00 pm

Vikrant Varma, Rohin Shah, Zachary Kenton, János Kramár, Ramana Kumar: Explaining grokking through circuit efficiency. CoRR abs/2309.02390 (2023)

View Article

Image may be NSFW.
Clik here to view.

Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice...

December 31, 2022, 3:00 pm

Tom Lieberum, Matthew Rahtz, János Kramár, Neel Nanda, Geoffrey Irving, Rohin Shah, Vladimir Mikulik: Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in...

View Article

Image may be NSFW.
Clik here to view.

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the...

December 31, 2022, 3:00 pm

View Article

Image may be NSFW.
Clik here to view.

SIRL: Similarity-based Implicit Representation Learning.

December 31, 2022, 3:00 pm

Andreea Bobu, Yi Liu, Rohin Shah, Daniel S. Brown, Anca D. Dragan: SIRL: Similarity-based Implicit Representation Learning. CoRR abs/2301.00810 (2023)

View Article

Image may be NSFW.
Clik here to view.

SIRL: Similarity-based Implicit Representation Learning.

December 31, 2022, 3:00 pm

Andreea Bobu, Yi Liu, Rohin Shah, Daniel S. Brown, Anca D. Dragan: SIRL: Similarity-based Implicit Representation Learning. HRI 2023: 565-574

View Article

Image may be NSFW.
Clik here to view.

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training...

December 31, 2022, 3:00 pm

Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Rohin Shah: BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking...

View Article

AtP*: An efficient and scalable method for localizing LLM behaviour to...

December 31, 2023, 3:00 pm

János Kramár, Tom Lieberum, Rohin Shah, Neel Nanda: AtP*: An efficient and scalable method for localizing LLM behaviour to components. CoRR abs/2403.00745 (2024)

View Article

Evaluating Frontier Models for Dangerous Capabilities.

December 31, 2023, 3:00 pm

Mary Phuong, Matthew Aitchison, Elliot Catt, Sarah Cogan, Alexandre Kaskasoli, Victoria Krakovna, David Lindner, Matthew Rahtz, Yannis Assael, Sarah Hodkinson, Heidi Howard, Tom Lieberum, Ramana Kumar,...

View Article

More Pages to Explore .....

Latest Images