Note: this competition is now over. You can find the retrospective of the competition here. If you wish to tackle the BASALT tasks yourself, see our benchmark paper here.


Click here to sign-up to participate on AIcrowd!

Click here to join our Discord for updates.

We are running a new iteration of the BASALT competition in 2022! Find the old page here, the summary of the results in this paper and the list of changes this year here.

In the Benchmark for Agents that Solve Almost-Lifelike Task (BASALT) competition, your task is to solve tasks based on human judgement, instead of pre-defined reward functions. The goal is to produce agents that are judged by real humans to be effective at solving a given task. This calls for training on human-feedback, whether it is training from demonstrations, training on human preferences or using humans to correct agents’ actions.

Like the Diamond competition, BASALT provides a set of Gym environments paired with human demonstrations, since methods based on imitation are an important building block for solving hard-to-specify tasks. Along with the demonstrations, we will provide a set of pretrained models, trained on various tasks in Minecraft, for you to use or finetune as you please. Combined with human-feedback training, we vision that this finetuning from human-feedback will be a succesful method in the competition, with solutions applicable outside Minecraft.

Does all this sound too difficult or obscure to approach? Fret not! We will also have an intro track, which will have easier “obtain item X” tasks for you to solve. These will help you get familiar with the codebase and the tools we provide, and hopefully encourage you to partake in the main BASALT tasks! We will also have cash prizes for the best solutions and the most scientifically interesting ones (see Prizes).

The BASALT Tasks

FindCave

The agent should search for a cave, and terminate the episode when it is inside one.

These videos do not represent the new MineRL environment or the new dataset.

MakeWaterfall

After spawning in a mountainous area, the agent should build a beautiful waterfall and then reposition itself to take a scenic picture of the same waterfall.

These videos do not represent the new MineRL environment or the new dataset.

CreateVillageAnimalPen

After spawning in a village, the agent should build an animal pen containing two of the same kind of animal next to one of the houses in a village.

These videos do not represent the new MineRL environment or the new dataset.

BuildVillageHouse

Using items in its starting inventory, the agent should build a new house in the style of the village, in an appropriate location (e.g. next to the path through the village), without harming the village in the process.

These videos do not represent the new MineRL environment or the new dataset.


Schedule (preliminary)

Competition Overview

All submissions are done through AIcrowd. Please follow the AICrowd page for the most up-to-date information and instructions, which are subject to change prior to the competition start.

Submission: Submit Trained Agents

Online Leaderboard

Evaluation Phase 1

Evaluation Phase 2

Validation


Material and baselines

Baselines will be shared prior to or at the competition start (July 1st). Please check the AICrowd page and Discord for updates (links at the top).

Meanwhile, you may find the following material useful for learning about training on human-feedback.

We do not require (or expect) all of the participants to understand all of these topics. A simple solution with outside-the-box thinking may prove to be the winning solution, and we wish to see curious people exploring their ideas, regardless of if they are based on academic works or not!

Prizes

Thanks to the overwhelming generosity of sponsors, there will be $20,000 worth of cash prizes with a conditional $50,000 - $100,000 milestone prize! In addition, the top three solutions will be invited to co-author the competition report.

Note that as we expect to be unable to evaluate all submissions, prizes may be restricted to entries that reach the second evaluation phase, or the validation phase, at the organizers’ discretion. Prize winners are expected to present their solutions at NeurIPS.

The milestone prize is awarded if a solution reaches a considerable (but reachable) milestone in this competition, such as reaching a certain level of performance.

Details of the different prizes will be provided later on the AICrowd page.

Changes from BASALT 2021

This all may sound familiar to you if you know the previous version of this competition, BASALT 2021. We do have some notable differences this year:

  1. We will provide a number of pretrained “foundational” models, trained on a wide range of Minecraft tasks, for you to use in your submission and/or to finetune. You are free to use any methods as part of your submission, but we believe these models will empower participants to better solve the given tasks.
  2. A new MineRL simulator, which replicates human observation and action space almost one-to-one. This means there are no more craft or place actions. Instead, you have to open the inventory UI and use the mouse pointer to craft items. While this is a more challenging way of playing Minecraft, being closer to human’s way of playing the game makes obtaining human demonstrations easier.
  3. A new dataset for the BASALT tasks using the new MineRL environment.
  4. To encourage the use of methods utilizing human-feedback, we will be giving out specialization prizes for solutions even if they did not win in the main metric. For example, if your solution is based mainly on human demonstrations (e.g., behavioural cloning) and reached the fourth place, you may be awarded the prize for the best solution using human demonstrations (Note: this was just an example, not a description of a prize!). More details will be provided on the AICrowd page.
  5. To ease your entry into the competition, we will have an intro track which uses an ObtainX environment, akin to the ObtainDiamond challenge in the Diamond 2021 challenge. This is designed to help you get familiar with the new environment, but we strongly encourage you to participate in the BASALT tasks.

Team

The organizing team consists of:

Advisors:

Sponsors:

Contact

If you have any questions, please feel free to contact us at basalt (at) minerl.io or on our Discord.