Quantcast
Channel: dblp: Rohin Shah
Browsing latest articles
Browse All 37 View Live

Image may be NSFW.
Clik here to view.

Chlorophyll: synthesis-aided compiler for low-power spatial architectures.

Phitchaya Mangpo Phothilimthana, Tikhon Jelvis, Rohin Shah, Nishant Totla, Sarah E. Chasins, Rastislav Bodík: Chlorophyll: synthesis-aided compiler for low-power spatial architectures. PLDI 2014: 396-407

View Article



Image may be NSFW.
Clik here to view.

SIMPL: A DSL for Automatic Specialization of Inference Algorithms.

Rohin Shah, Emina Torlak, Rastislav Bodík: SIMPL: A DSL for Automatic Specialization of Inference Algorithms. CoRR abs/1604.04729 (2016)

View Article

Image may be NSFW.
Clik here to view.

Active Inverse Reward Design.

Sören Mindermann, Rohin Shah, Adam Gleave, Dylan Hadfield-Menell: Active Inverse Reward Design. CoRR abs/1809.03060 (2018)

View Article

Image may be NSFW.
Clik here to view.

On the Utility of Learning about Humans for Human-AI Coordination.

Micah Carroll, Rohin Shah, Mark K. Ho, Thomas L. Griffiths, Sanjit A. Seshia, Pieter Abbeel, Anca D. Dragan: On the Utility of Learning about Humans for Human-AI Coordination. CoRR abs/1910.05789 (2019)

View Article

Image may be NSFW.
Clik here to view.

On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward...

Rohin Shah, Noah Gundotra, Pieter Abbeel, Anca D. Dragan: On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference. CoRR abs/1906.09624 (2019)

View Article


Image may be NSFW.
Clik here to view.

Preferences Implicit in the State of the World.

Rohin Shah, Dmitrii Krasheninnikov, Jordan Alexander, Pieter Abbeel, Anca D. Dragan: Preferences Implicit in the State of the World. CoRR abs/1902.04198 (2019)

View Article

Image may be NSFW.
Clik here to view.

On the Utility of Learning about Humans for Human-AI Coordination.

Micah Carroll, Rohin Shah, Mark K. Ho, Tom Griffiths, Sanjit A. Seshia, Pieter Abbeel, Anca D. Dragan: On the Utility of Learning about Humans for Human-AI Coordination. NeurIPS 2019: 5175-5186

View Article

Image may be NSFW.
Clik here to view.

On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward...

Rohin Shah, Noah Gundotra, Pieter Abbeel, Anca D. Dragan: On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference. ICML 2019: 5670-5679

View Article


Image may be NSFW.
Clik here to view.

Preferences Implicit in the State of the World.

Rohin Shah, Dmitrii Krasheninnikov, Jordan Alexander, Pieter Abbeel, Anca D. Dragan: Preferences Implicit in the State of the World. ICLR (Poster) 2019

View Article


Image may be NSFW.
Clik here to view.

The MAGICAL Benchmark for Robust Imitation.

Sam Toyer, Rohin Shah, Andrew Critch, Stuart Russell: The MAGICAL Benchmark for Robust Imitation. CoRR abs/2011.00401 (2020)

View Article

Image may be NSFW.
Clik here to view.

The MAGICAL Benchmark for Robust Imitation.

Sam Toyer, Rohin Shah, Andrew Critch, Stuart Russell: The MAGICAL Benchmark for Robust Imitation. NeurIPS 2020

View Article

Image may be NSFW.
Clik here to view.

Choice Set Misspecification in Reward Inference.

Rachel Freedman, Rohin Shah, Anca D. Dragan: Choice Set Misspecification in Reward Inference. AISafety@IJCAI 2020

View Article

Image may be NSFW.
Clik here to view.

Extracting and Using Preference Information from the State of the World.

Rohin Shah: Extracting and Using Preference Information from the State of the World. University of California, Berkeley, USA, 2020

View Article


Image may be NSFW.
Clik here to view.

The MineRL BASALT Competition on Learning from Human Feedback.

Rohin Shah, Cody Wild, Steven H. Wang, Neel Alex, Brandon Houghton, William H. Guss, Sharada P. Mohanty, Anssi Kanervisto, Stephanie Milani, Nicholay Topin, Pieter Abbeel, Stuart Russell, Anca D....

View Article

Image may be NSFW.
Clik here to view.

Learning What To Do by Simulating the Past.

David Lindner, Rohin Shah, Pieter Abbeel, Anca D. Dragan: Learning What To Do by Simulating the Past. CoRR abs/2104.03946 (2021)

View Article


Image may be NSFW.
Clik here to view.

Combining Reward Information from Multiple Sources.

Dmitrii Krasheninnikov, Rohin Shah, Herke van Hoof: Combining Reward Information from Multiple Sources. CoRR abs/2103.12142 (2021)

View Article

Image may be NSFW.
Clik here to view.

Choice Set Misspecification in Reward Inference.

Rachel Freedman, Rohin Shah, Anca D. Dragan: Choice Set Misspecification in Reward Inference. CoRR abs/2101.07691 (2021)

View Article


Image may be NSFW.
Clik here to view.

Evaluating the Robustness of Collaborative Agents.

Paul Knott, Micah Carroll, Sam Devlin, Kamil Ciosek, Katja Hofmann, Anca D. Dragan, Rohin Shah: Evaluating the Robustness of Collaborative Agents. CoRR abs/2101.05507 (2021)

View Article

Image may be NSFW.
Clik here to view.

Optimal Policies Tend To Seek Power.

Alexander Matt Turner, Logan Smith, Rohin Shah, Andrew Critch, Prasad Tadepalli: Optimal Policies Tend To Seek Power. NeurIPS 2021: 23063-23074

View Article

Image may be NSFW.
Clik here to view.

Retrospective on the 2021 MineRL BASALT Competition on Learning from Human...

Rohin Shah, Steven H. Wang, Cody Wild, Stephanie Milani, Anssi Kanervisto, Vinicius G. Goecks, Nicholas R. Waytowich, David Watkins-Valls, Bharat Prakash, Edmund Mills, Divyansh Garg, Alexander Fries,...

View Article

Image may be NSFW.
Clik here to view.

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the...

Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Sharada P. Mohanty, Byron Galbraith, Ke Chen, Yan Song, Tianze Zhou, Bingquan Yu, He Liu, Kai Guan, Yujing...

View Article


Image may be NSFW.
Clik here to view.

An Empirical Investigation of Representation Learning for Imitation.

Cynthia Chen, Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H. Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah: An Empirical Investigation of...

View Article


Image may be NSFW.
Clik here to view.

Learning What To Do by Simulating the Past.

David Lindner, Rohin Shah, Pieter Abbeel, Anca D. Dragan: Learning What To Do by Simulating the Past. ICLR 2021

View Article

Image may be NSFW.
Clik here to view.

Evaluating the Robustness of Collaborative Agents.

Paul Knott, Micah Carroll, Sam Devlin, Kamil Ciosek, Katja Hofmann, Anca D. Dragan, Rohin Shah: Evaluating the Robustness of Collaborative Agents. AAMAS 2021: 1560-1562

View Article

Image may be NSFW.
Clik here to view.

Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct...

Rohin Shah, Vikrant Varma, Ramana Kumar, Mary Phuong, Victoria Krakovna, Jonathan Uesato, Zac Kenton: Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals. CoRR...

View Article


Image may be NSFW.
Clik here to view.

An Empirical Investigation of Representation Learning for Imitation.

Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H. Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah: An Empirical Investigation of Representation...

View Article

Image may be NSFW.
Clik here to view.

Retrospective on the 2021 BASALT Competition on Learning from Human Feedback.

Rohin Shah, Steven H. Wang, Cody Wild, Stephanie Milani, Anssi Kanervisto, Vinicius G. Goecks, Nicholas R. Waytowich, David Watkins-Valls, Bharat Prakash, Edmund Mills, Divyansh Garg, Alexander Fries,...

View Article

Image may be NSFW.
Clik here to view.

Challenges with unsupervised LLM knowledge discovery.

Sebastian Farquhar, Vikrant Varma, Zachary Kenton, Johannes Gasteiger, Vladimir Mikulik, Rohin Shah: Challenges with unsupervised LLM knowledge discovery. CoRR abs/2312.10029 (2023)

View Article

Image may be NSFW.
Clik here to view.

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training...

Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Rohin Shah: BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking...

View Article



Image may be NSFW.
Clik here to view.

Explaining grokking through circuit efficiency.

Vikrant Varma, Rohin Shah, Zachary Kenton, János Kramár, Ramana Kumar: Explaining grokking through circuit efficiency. CoRR abs/2309.02390 (2023)

View Article

Image may be NSFW.
Clik here to view.

Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice...

Tom Lieberum, Matthew Rahtz, János Kramár, Neel Nanda, Geoffrey Irving, Rohin Shah, Vladimir Mikulik: Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in...

View Article

Image may be NSFW.
Clik here to view.

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the...

Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Sharada P. Mohanty, Byron Galbraith, Ke Chen, Yan Song, Tianze Zhou, Bingquan Yu, He Liu, Kai Guan, Yujing...

View Article

Image may be NSFW.
Clik here to view.

SIRL: Similarity-based Implicit Representation Learning.

Andreea Bobu, Yi Liu, Rohin Shah, Daniel S. Brown, Anca D. Dragan: SIRL: Similarity-based Implicit Representation Learning. CoRR abs/2301.00810 (2023)

View Article


Image may be NSFW.
Clik here to view.

SIRL: Similarity-based Implicit Representation Learning.

Andreea Bobu, Yi Liu, Rohin Shah, Daniel S. Brown, Anca D. Dragan: SIRL: Similarity-based Implicit Representation Learning. HRI 2023: 565-574

View Article

Image may be NSFW.
Clik here to view.

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training...

Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Rohin Shah: BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking...

View Article

AtP*: An efficient and scalable method for localizing LLM behaviour to...

János Kramár, Tom Lieberum, Rohin Shah, Neel Nanda: AtP*: An efficient and scalable method for localizing LLM behaviour to components. CoRR abs/2403.00745 (2024)

View Article


Evaluating Frontier Models for Dangerous Capabilities.

Mary Phuong, Matthew Aitchison, Elliot Catt, Sarah Cogan, Alexandre Kaskasoli, Victoria Krakovna, David Lindner, Matthew Rahtz, Yannis Assael, Sarah Hodkinson, Heidi Howard, Tom Lieberum, Ramana Kumar,...

View Article

Browsing latest articles
Browse All 37 View Live




Latest Images