Data description

Training datasets

The public instances can been downloaded here, and are to placed inside the instances folder. The folder structure after the datasets are set up looks as follows

instances/
  1_item_placement/
    train/           -> 9900 instances
    valid/           -> 100 instances
  2_load_balancing/
    train/           -> 9900 instances
    valid/           -> 100 instances
  3_anonymous/
    train/           -> 98 instances
    valid/           -> 20 instances

Important note: for each benchmark we propose a pre-defined split of the public instances into a training set (train) and a validation set (valid). Participants do not have to respect this arbitrary choice, and are free to use all the provided instances in whichever way they like without any restriction. All the instances included in train and valid can be considered training instances.

Test datasets

The test instances used to evaluate the submissions will be kept hidden until the end of the competition.

instances/
  1_item_placement/
    test/           -> 100 instances
  2_load_balancing/
    test/           -> 100 instances
  3_anonymous/
    test/           -> 20 instances

File formats

Each problem instance is composed of two files which follows the same naming pattern, for instance,

item_placement_147.mps.gz  -> the MILP instance file in compressed MPS format
item_placement_147.json    -> a JSON file with pre-computed information about the instance

In the JSON files we store a pre-computed initial primal bound and initial dual bound for each instance, which are used in the computation of our evaluation metrics. The JSON content look as follows:

{"dual_bound": 4.063450550000058, "primal_bound": 671.5409895199994}

Those initial bounds were obtained as follows:

primal bound: the value of the first feasible solution found by the SCIP solver
dual bound: the value of the first LP relaxation solved by the SCIP solver

Problem benchmarks

Here we give a short description of each problem benchmark. In particular, we describe how each problem instance is modeled as a Mixed-Integer Linear Program (MILP).

Benchmark 1: Balanced Item Placement

There are $I$ items, $B$ bins, and $R$ resource types. Each item $i$ has a fixed resource requirement, for each resource type $r$ . Each bin $b$ has a fixed capacity, for each resource type $r$ . The goal is to place all items in bins, while minimizing the imbalance of the resources used across all bins.

Constants

$\textit{Capacity}_{b,r}$ the amount of resource $r$ available in bin $b$ .

$\textit{Size}_{i,r}$ the amount of resource $r$ required by item $i$ .

Decision variables

place_$i_$b: a binary variable indicating whether to place item $i$ in bin $b$ .

$\forall{i,b},\quad \textit{place}_{i,b} \in \{0,1\}$

Implicit decision variables

deficit_$b_$r: a continuous variable between 0 and 1 tracking the normalized imbalance of resource $r$ in bin $b$ .

$\forall{b,r},\quad \textit{deficit}_{b,r} \in [0,1]$

max_deficit_$r: a continuous variable between 0 and 1 tracking the max normalized imbalance of resource $r$ across all bins.

$\forall{r},\quad \textit{max\_deficit}_{r} \in [0,1]$

Constraints

copies_ct_$i: all items must be placed once.

$\forall{i},\quad \sum_b \textit{place}_{i,b} = 1$

supply_ct_$b_$r: bin capacities must be respected.

$\forall{b,r},\quad \sum_i \textit{Size}_{i,r} \times \textit{place}_{i,b} \leq \textit{Capacity}_{b,r}$

deficit_ct_$b_$r: normalized imbalance of resources is tracked for each bin and resource.

$\forall{b,r},\quad 1 - \frac{B}{\sum_i \textit{Size}_{i,r}}\sum_i \textit{Size}_{i,r} \times \textit{place}_{i,b} = \textit{deficit}_{b,r}$

max_deficit_ct_$r: max normalized imbalance of resources across all bins is tracked for each resource.

$\forall_{b,r},\quad \textit{deficit}_{b,r} \leq \textit{max\_deficit}_{r}$

Objective

Minimize the imbalance of resources used across all bins.

$\text{minimize}\quad 10\times B\times R \times\sum_r \textit{max\_deficit}_{r}+\sum_{b,r}\textit{deficit}_{b,r}$

Benchmark 2: Workload Apportionment

There are $I$ workers and $J$ workloads. Each worker $i$ has a fixed capacity and activation cost. Each workload $j$ has a fixed amount of work required and a set of allowed workers. The goal is to minimize the total cost for processing all workloads, under the constraint that any one worker is allowed to fail (robust apportionment).

Constants

$\textit{Capacity}_{i}$ the capacity of worker $i$ .

$\textit{Cost}_{i}$ the activation cost of worker $i$ .

$\textit{Load}_{j}$ the amount of work required to process workload $j$ .

$\textit{Allowed}_{j}$ the set of workers allowed to process workload $j$ .

Decision variables

reserved_capacity_$i_$j: a non-negative continuous variable indicating the amount of work reserved on worker $i$ for workload $j$ .

$\forall{i,j},\quad \textit{reserved\_capacity}_{i,j} \in [0,\infty]$

worker_used_$i: a binary variable indicating whether worker $i$ must be activated.

$\forall{i,j},\quad \textit{worker\_used}_{i} \in \{0,1\}$

Constraints

(encoded as variable upper bound for reserved_capacity_$i_$j): only allowed workers can process workloads.

$\forall{i \notin \textit{Allowed}_{j}},\forall{j},\quad \textit{reserved\_capacity}_{i,j}\leq 0$

worker_capacity_ct_$i: worker capacity must be respected.

$\forall{i},\quad \sum_{j}\textit{reserved\_capacity}_{i,j}\leq \textit{Capacity}_{i}$

worker_used_ct_$i_$j: activation indicator is tracked for each worker.

$\forall{i,j},\quad \sum_{j}\textit{reserved\_capacity}_{i,j}\leq \max(\textit{Capacity}_{i},\textit{Load}_{j})\times\textit{worker\_used}_{i}$

workload_ct_$j_failure_$i: there must be sufficient capacity for each workload in the scenario where any one of the workers is unavailable.

$\forall{i \in \textit{Allowed}_{j}},\forall{j},\quad \sum_{i'\neq i}\textit{reserved\_capacity}_{i',j}\geq \textit{Load}_{j}$

Objective

Minimize the total cost for processing all workloads.

$\text{minimize}\quad \sum_{i} \textit{Cost}_{i}\times\textit{worker\_used}_{i}$

Benchmark 3: Anonymous Problem

The third problem benchmark is anonymous, and thus we do not provide a description of the problem instances.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DATA.md

DATA.md

Data description

Training datasets

Test datasets

File formats

Problem benchmarks

Benchmark 1: Balanced Item Placement

Constants

Decision variables

Implicit decision variables

Constraints

Objective

Benchmark 2: Workload Apportionment

Constants

Decision variables

Constraints

Objective

Benchmark 3: Anonymous Problem

Files

DATA.md

Latest commit

History

DATA.md

File metadata and controls

Data description

Training datasets

Test datasets

File formats

Problem benchmarks

Benchmark 1: Balanced Item Placement

Constants

Decision variables

Implicit decision variables

Constraints

Objective

Benchmark 2: Workload Apportionment

Constants

Decision variables

Constraints

Objective

Benchmark 3: Anonymous Problem