junrushao1994 edited a comment on pull request #9729:
URL: https://github.com/apache/tvm/pull/9729#issuecomment-993208836


   Thank you everyone for raising the PR and providing important context on 
this PR! I really appreciate that we come together and discuss about 
architectural changes. These discussions makes TVM better.
   
   ## Motivation of This PR
   
   The main motivation is to bring a runtime Module onto IRModule. Runtime 
Module that is generated through BYOC is previously stored in the attributes of 
an IRModule. One motivation of this PR is to make it more explicit by moving 
this to a field of IRModule. Notably, both the existing attribute-based 
approach and proposed field-based approached could equivalently support the 
feature we need.
   
   To make the discussion more concise, we denote:
   - A1. Adding new runtime::Modules as a required field to IRModule
   - A2. Optionally storing BYOC runtime::Module inside IRModule's attributes
   
   ## Desired Solution
   
   The fundamental need here is that we need to store compiled artifact as part 
of the IRModule so the IRModule can serve a single source of truth throughout 
compilation.
   
   However, as Mark pointed out, the runtime::Module right now has been two 
overloaded properties:
   - P1. Used to represent compiled artifact
   - P2. Used to represent in-memory data structure as a collection of packed 
functions
   
   Runtime::Module was designed for P2 but not very clearly for P1. As a 
result, most of the runtime::Modules are not bidirectionally serializable. 
However, such properties are important for IRModule.
   
   Therefore, an ultimate solution is to introduce another object for compiled 
functions, i.e. Artifact. We had some discussion previously 
(https://discuss.tvm.apache.org/t/introduce-artifact-a-container-for-generated-code/11099)
 but it hasn't been pinned down yet.
   
   The main difference between runtime::Module and Artifact is: in the case of 
runtime::Module, serializability is a less of a concern; However, in the case 
of Artifact, while there are possible cases, things are still not always 
serializable. For example, in-memory JIT'ed artifacts. But serializability is 
always a first-class concern for designers of the IR itself.
   
   ## Design Discussion
   
   A fundamental data structure, say an IR, for example, CFG in LLVM, is a 
common ground for all the developers. Changing the IR usually needs 
deliberation because it affects all the developers of the compiler. An ancient 
example (which I believe has been fixed recently) is dead-code elimination in 
Relay - at the time of implementing that pass, it relied on an assumption that 
there were no effects in Relay, which became untrue immediately after 
RefRead/RefWrite were introduced. And after that IR change, the promise of this 
pass broke and led to overall collapse of the compiler infrastructure when fed 
with it the new IR nodes.
   
   Introducing this field to IRModule right now can bring a different set of 
expectations. More specifically,
   - pass writers need to consider if they have to deal with runtime::Modules 
collections
   - Additionally, it will bring up the concern of first-class fields' 
serializability
   Putting it in the attributes, on the other hand, is less a serious 
commitment.
   
   Of course, it's hard to jump to the ultimate solution (Artifacts). 
Considering the path of migration, it's better to keep the fields as the 
optional one in IRModule.attrs for now, so migrating to Artifact-based solution 
is easier.
   
   It is also a common practice in compilers. Usually a solution starts with 
intrinsics, attributes and pragmas, and evolves gradually into first-class 
fields as design matures. 
   
   I believe there is good motivation for the feature change. The IR change, 
motivated by the same feature, could unintentionally bring broad impact that 
affects other ongoing parallel developments. It's great to have to have such 
discussion as part of the RFC. Constructive discussion of all technical 
perspectives makes the design better! Thank you all for having constructive 
discussion!
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@tvm.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to