Skip to content

Paper Review | Scalable Computation of High-Order Optimization Queries

Published: at 06:56 PM

Scalable Computation of High-Order Optimization Queries

Paper link

Overview with simple words

The paper proposes a solution for finding a set of tuples that collectively satisfy certain group constraints. The set of tuples could be filtered initially using the traditional DBMS predicates. The group constraints limits the aggregate value of a group of tuples. It finds the solution by turning the question into an optimization problem and solved by a black-box solver. Additionally, the paper simplifies the optimization problem by finding out a few representative tuples of the base relation, and first construct a solution based on them. It then iteratively substitute the representative tuples by the real tuples in a smaller optimization problem.

Details

Construction of a package query

  1. Basic package query: select relevant attributes from the desired relation
  2. Repetition constraints: limits the number of times one tuple appears in the package
  3. Predicates
    1. Base predicates: individual predicate on each tuple, filtered by the DBMS
    2. Global predicates: constraints on the group of tuples, solved by ILP solver
  4. Objective Clause: ranking of packages, either minimize or maximize

Query→ILP using Direct

Each tuple in the base relation is assigned a variable xix_i, that indicates the number of times it could appear in the package. xi=0x_i=0 for tuples that does not satisfy the base predicates. It then expresses the global predicates as inequalities. Finally, we minimize/maximize the objective clause under the previous constraints.

SketchRefine

Direct is too slow and is not scalable, also, the optimization problem is hard to solve, need to decompose into smaller problems.

Inspiration

Pros

Cons

May have false infeasible query answer