merge new planner into master #188

jpacik · 2018-10-26T17:58:55Z

No description provided.

wolffcm · 2018-10-26T18:05:32Z

control/controller.go

@@ -318,6 +319,7 @@ func (c *Controller) processQuery(q *Query) (pop bool, err error) {
 			return true, errors.New("failed to transition query into executing state")
 		}
 		q.alloc = new(execute.Allocator)
+		// TODO: pass the plan to the executor here


This comment can go away.

nathanielc

LGTM! Just gave it a read through.

jsternberg · 2018-10-26T19:46:40Z

docs/Planner.md


-## Data Frames
+type PhysicalPlanner interface {
+	Plan(*PlanSpec) (*PlanSpec, error)


From an API design standpoint, is it possible to use the logical planner, skip the physical planner, send the plan spec to the executor, and get a valid (albeit not optimized) result?

If it's not, I think we should change the API to have the physical planner return a physical plan object or something that gives us a strong type guarantee that the correct path has been followed.

Another thing I haven't been clear about is if there's a reason why, on the interface level alone, these two are separated.

Practically, today it's not possible to use just the logical planner and skip physical planning, because a each from needs a range to be pushed into it, and that push down is done as a physical optimization.

Some more depth: right now all our procedure specs are both logical procedure specs and physical procedure specs. That is, they all have a Cost method and can be translated directly into an instance of Transformation or Source. But that won't always be the case; at some point we'll have (for example) a JoinProcedureSpec which is logical (no Cost method and can't be converted to a Transformation) and the physical planner will need to choose either MergeJoinProcedureSpec or NestedLoopProcedureSpec for the physical plan.

(I need to add the above to the docs.)

I think you're right that the logical planner should return something more opaque. Josh and I have talked about breaking up PlanSpec into logical and physical counterparts.

As to why the logical/physical divide is exposed on the interface level, that's a good question. Honestly, I was following the example of what was there in the control package already. I can think of good reasons for the divide, like maybe wanting to cache logical plans to use later on, but that seems to be a ways off.

👍 I like the answer and I'm really looking forward to a further API-level differentiation between the two. I can easily see now why the difference would be needed, but we do need to have them as separate APIs. The influxql query engine suffered for a long time because the outward facing APIs made a lot of assumptions about the previous steps that were invoked without having them encoded as part of the type system so it's important that we do that before having a stable API.

jsternberg · 2018-10-26T20:15:17Z

functions/inputs/from.go

+// Rewrite attempts to rewrite a `from -> range` into a `FromRange`
+func (rule MergeFromRangeRule) Rewrite(node plan.PlanNode) (plan.PlanNode, bool, error) {
+	from := node.Predecessors()[0]
+	fromSpec := from.ProcedureSpec().(*FromProcedureSpec)


It would be nice if we did matching rules on the actual types and the kind was only used for JSON encoding. I always feel a bit uncomfortable with encoding the kind into logic like this, but it's everywhere else so this comment is by no means a suggestion to change this.

I would like to revisit it in the future though.

I'm open to using something else. Ultimately, we want our patterns to be able to not just recognize a procedure of a particular type, but some class of procedures within a taxonomy (like any source that can have a filter pushed into it, any aggregate function, etc).

jsternberg · 2018-10-26T20:18:19Z

Even if the commits are messy, this is likely best to merge without attempting any rebase or squashing any commits.

interface for planner

integrate planner and execution engine

* add plan resources * validate plan has correct number of yield nodes

* remove old planner * rename planner package to plan package * rename all occurrences of planner to plan

* incorrect removal of predecessors and successors * move GroupMode to functions package due to circular dependency * need to register sources * push range down into from * make rule names consistent Also use intersect semantics for pushing multiple range operations into a from operation. * fix remove yields * change rule name

* register procedures with side effects * do not generate yield nodes for side effect operations * use mock procedure spec in plantest

jpacik requested review from nathanielc, aanthony1243, jsternberg, stuartcarnie and wolffcm October 26, 2018 18:00

wolffcm reviewed Oct 26, 2018

View reviewed changes

nathanielc approved these changes Oct 26, 2018

View reviewed changes

wolffcm approved these changes Oct 26, 2018

View reviewed changes

jsternberg approved these changes Oct 26, 2018

View reviewed changes

jlapacik and others added 20 commits October 26, 2018 14:34

how I envision physical operations looking (WIP)

f3809cf

Logical and physical plan nodes with common interface

9e4f347

interface for abstract transformation rules

542e2b5

interface for planner

first pass logical-to-physical planner

a6bc7f4

A Pattern interface for rules (#94)

cc991fa

top-level QueryPlan struct

4f3b656

convert a query spec into a logical plan

f8da9f6

Modify plan-traversal algorithm (#101)

2d80926

Fix plan_traversal_test

a5554a2

Reorder imports in plan_traversal_test

4bba2c1

tests for translating a Flux spec into logical plan

815d397

improve query plan comparison

4f9b060

Point code in "functions" at new planner (#117)

09a3aa1

Rename QueryPlan to PlanSpec (#118)

2a0849e

Make control package use new planner (#124)

a288c37

Add BottomUpWalk method to PlanSpec (#125)

3bcdc66

Integrate new query plans with the execution engine (#126)

e6b5a12

integrate planner and execution engine

* remove stream context (#128)

2a09dbe

* add plan resources * validate plan has correct number of yield nodes

Add rule registration to planners (#130)

bdc9ee1

remove yields during physical planning (#134)

dfbf1be

Christopher M. Wolff and others added 11 commits October 26, 2018 14:42

Add unit tests that exercise rules; fix bugs found along the way (#140)

190d39e

add the notion of bounds to plan nodes (#142)

47e4695

remove old planner and replace with new one (#147)

14fc14a

* remove old planner * rename planner package to plan package * rename all occurrences of planner to plan

Refactor how yields are handled in planner (#160)

40e86b0

go mod tidy

88b268c

remove notion of ProcedureID (#175)

bc4b0b1

Rule to push filters into from (#159)

a931b66

update planner docs (#168)

144beca

register side effect procedures (#183)

7ea1cb9

* register procedures with side effects * do not generate yield nodes for side effect operations * use mock procedure spec in plantest

Fail physical planning for from with no range pushed down (#185)

187beb8

jpacik force-pushed the feat-planner branch from 55bd925 to 187beb8 Compare October 26, 2018 22:03

jpacik merged commit 39f39ce into master Oct 26, 2018

jpacik deleted the feat-planner branch October 26, 2018 22:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

merge new planner into master #188

merge new planner into master #188

Uh oh!

jpacik commented Oct 26, 2018

Uh oh!

wolffcm Oct 26, 2018

Uh oh!

nathanielc left a comment

Uh oh!

jsternberg Oct 26, 2018

Uh oh!

wolffcm Oct 26, 2018

Uh oh!

jsternberg Oct 26, 2018

Uh oh!

jsternberg Oct 26, 2018

Uh oh!

wolffcm Oct 26, 2018

Uh oh!

jsternberg commented Oct 26, 2018

Uh oh!

Uh oh!

merge new planner into master #188

merge new planner into master #188

Uh oh!

Conversation

jpacik commented Oct 26, 2018

Uh oh!

wolffcm Oct 26, 2018

Choose a reason for hiding this comment

Uh oh!

nathanielc left a comment

Choose a reason for hiding this comment

Uh oh!

jsternberg Oct 26, 2018

Choose a reason for hiding this comment

Uh oh!

wolffcm Oct 26, 2018

Choose a reason for hiding this comment

Uh oh!

jsternberg Oct 26, 2018

Choose a reason for hiding this comment

Uh oh!

jsternberg Oct 26, 2018

Choose a reason for hiding this comment

Uh oh!

wolffcm Oct 26, 2018

Choose a reason for hiding this comment

Uh oh!

jsternberg commented Oct 26, 2018

Uh oh!

Uh oh!