This is an automated email from the ASF dual-hosted git repository. masahi pushed a change to branch ci-docker-staging in repository https://gitbox.apache.org/repos/asf/tvm.git
omit 869a8f9591 [CI] Add Zephyr-SDK binaries to PATH env. in ci_cortexm add e8aeb4adf3 [CI] Add Zephyr-SDK binaries to PATH env. in ci_cortexm (#12884) add eba75e4640 [METASCHEDULE] Mark work_dir as not optional in docs (#12888) add 428269f80c [FIX,PROFILING] Fix PAPI docs (#12861) add fbb500e92f [TIR][Schedule] Relax cache read/write's restriction and fix unexpected behavior (#12766) add 71f25b3d6c [IR] Use TVM_DEFINE_OBJECT_REF_METHODS macro for Op (#12893) add a61c1ad0f0 [TIR] Fix plan buffer allocation location for loop carried dependencies (#12757) add c8423a6843 [Meta Schedule][XGBoost] Update the custom callback function of xgboost in meta schedule (#12141) add 46ea2ed42e [MetaSchedule][UX] User Interface for Jupyter Notebook (#12866) add cc6e01edc6 [frontend][pytorch]support aten::zero_ operator (#12872) add 87085b0e0d [frontend][pytorch]Support aten::Tensor_split operator (#12871) add 4ef1465d40 [skip ci] Temporarily disable comments bot (#12903) add b6a660be58 [BUILD] Re-enable ccache by default (#12839) add 8711ba44b9 [TVMScript] Import TIR methods into the IRBuilder (#12900) add fd26813723 [TVMScript] Infer T.match_buffer parameters for region (#12890) add e1f3f90588 [TOPI][Hexagon] Implement quantize op for hexagon (#12820) add f25a702a1f [TOPI][Hexagon] Add schedule and test for maxpool uint8 layout (#12826) add d4fb957ae1 [microTVM][ARM] Improve dense DSP micro kernel (#12908) add 830ebc4ec8 [TIR] Refactor IndexMap::Inverse in terms of NonSurjectiveInverse (#12904) add 5ddd35c377 [Relay][TE] Add default param name if needed (#12912) add 4d5ed07325 [TIR] Fix GetProducer/Consumer for duplicating dep edges (#12910) add f64e933246 [LLVM] Emit fp16/fp32 builtins directly into target module (#12877) add b61f633e10 [TVM PyTorch Integration] optimized_torch & as_torch how-to guide (#12318) add 7a4c10c44a [TIR][Transform] Remove num_unpacked_args from MakePackedAPI (#12892) add 7dbc68d108 [ONNX] Fix test_roi_align failure (#12906) add 77d8eef514 [Runtime][Bugfix] Added type-checking for Array::insert (#12691) add 9a673faa74 [ci] Initialize git during deploys (#12909) add 332b1469b7 [Hexagon] depth_to_space slice op (#12669) add 5a807e27c0 [Hexagon] [runtime] Add thread manager to resource management (#12905) add 82e6fc41f8 [microTVM] add the option to open a saved micro project for debugging (#12495) add a07a46ed19 [TIR] add unit-tests for upcoming primfunc-slicing (#12794) add bec9f16d42 [TIR][Transform] Clear buffer_map during MakeUnpackedAPI (#12891) add c89a8baeeb [usmp] Also remap VarNode to USMP-allocated buffer (#12880) add 178f82dc48 [TOPI] Implement Einsum with reduction axes (#12913) add d1c9febeca [ETHOSN] Remove support for 22.05 version of the driver stack (#12770) add 17e4644019 [TIR][MetaSchedule] Add regression test for layout_rewrite extent=1 (#12916) add e3a6cb6a1b [microTVM] Generalize depthwise_conv2d schedule (#12856) add 9d1fe6d8d1 [Target] Add Ampere GPUs CUDA tags (#12930) add 8af43d3c11 [Hexagon] [runtime] Add user DMA to device API resource management (#12918) add 68f9509b0c [TIR] Fix int64 dtype mismatch in Reindex (#12934) add 8c88aab778 [Bugfix][CMake] Update the minimum CMake version to 3.18 (#12682) add 5f132fd6c1 [ETHOSN] Support conversion of add/mul to requantize where possible (#12887) add 5634a1a17a [CODEGEN][OPENCL] Compatibility for OpenCL version 3.0 (#12938) add 0d8c9cef72 [Relay] Extend split for blocked ConvertLayout pass (#12886) add 9a45141165 [TIR] Use buffer's dtype when converting pad_value to TIR (#12925) add 3e3d900c66 [Virtual Machine] Implementation of 'set_output_zero_copy' (#11358) add ea01e3ffb4 [TIR] Preserve loop annotations in inject_software_pipeline pass (#12937) add 2379917985 [MetaSchedule] Add Script for TorchBench Model Tuning & Benchmarking (#12914) add 595f0b3975 [HEXAGON][QHL] Clippling the inputs of HVX version of QHL Sigmoid operation (#12919) add 25a54fb791 [TIR] Remove unused iters from the result of reindex (#12946) add 77c8b6e163 [Support] Add fallback definition of ccache in libinfo (#12945) add 4e4089edda [MetaSchedule] Fix XGBoost Import Issue (#12936) add e9eb0bc660 [LLVM] Change CHECK_NE(x, nullptr) to CHECK(x != nullptr), NFC (#12943) add dedf6393f1 [Hexagon] Change NULL to nullptr, NFC (#12944) add d4bf9ecf55 [Target] Add target_device_type attribute to override default device_type (#12509) add bf5637dc32 [DOCS][COMMUNITY] Elaborate Independence Principle for Project Participation (#12962) add c3357f6820 [Relay][Op] Register some forgotten op in Python side (#12963) add fa17da22c7 [Hexagon] Support template-free meta schedule tuning (#12854) add f121e5e355 [Hexagon] [runtime] VTCM Allocator (#12947) add f3d3ecebe1 [Hexagon] vrmpy tensorization for e2e compilation of int8 models (#12911) add 4e260d183f [BugFix][Pattern] Fixed a bug in PatternGrouper (#12901) add de6d806775 [CMSIS-NN] Support for int16 conv2d (#12950) add 1ea1a0bc88 [Hexagon] 3-stage pipeline; multi queue async DMA for cache read / write (#12954) add 61a7632b55 [microTVM] Use default Project Options in template projects and add Makefile for Arduino template project (#12818) add 3a125375a3 [Hexagon] Don't print simulator protocol messages (#12958) add a661864f85 [skip ci] Modify issue templates to align with Issue Tracking RFC (#12898) add af01526ae2 [skip ci] Edits to the Bug & Flaky test Issue templates to reduce word count (#12985) add a997c23e94 [CODEGEN][OPENCL] Sampler definition should be at outermost scope (#12951) add 9618e6a457 [TIR] Use IndexMap to transform NDArray (#12949) add 393d5f7fd2 [FIX,PROFILER] Fix PAPI test and test in CI (#12953) add 2e257f0376 [Hexagon] Do not use `target` test fixture in Hexagon tests (#12981) add 2860a50ff9 [Hexagon] Fix RPC session close by adding shutdown PackedFunc (#12960) add d023ef47fc [TIR, MetaSchedule] Preserve unit block iters for auto-tensorization (#12974) add 59b945742d [FIX,AUTOSCHEDULER,METASCHEDULE] Handle negative extents in featurization (#12990) add 3d5e440fff [ci] Disable flaky ethosu + roofline tests (#12956) add 2d50979606 [TVMScript] Allow T.bool type annotations (#12975) add 7fc35da3b9 [TEST] CPU feature detection for x86 and ARM dot product instructions (#12980) add 1b9e20a807 [ETHOSN] Transpose fully connected weights (#12970) add 6147c31b36 [ETHOSN] Remove backwards copy in the runtime (#12968) add 5e24aa2b78 [Hexagon] [runtime] Query for total and available VTCM (#12992) add 87c466cdee [Textures] Improve error reporting (#12986) add e375c311da [Arith][IndexMap] Correct MapShape result for small vectorized dims (#12927) add 7ff42a1d4e [TIR] Require dtype.is_float() inside FloatImm (#12928) add 50df4abf3b [Relay] optimize dumpir time (#12792) add 78df9c1d79 [LOGGING] Optionally print backtrace on segfault (#12959) add bb2cec1941 [Fix,MetaSchedule] Correct log usage in arithmetic intensity feature (#12806) add 6780c9f87d [MetaSchedule] Tuning API cleanup & ergonomics (#12895) add 7804a9886c [Hexagon] disable cache_write schedule type from sw pipeline test (#13004) add fc333f90e8 [TIR][Arith] Prove conditionals by transitively applying knowns (#12863) add d9e01475af [UnitTest][TIR] Support IRModule comparisons in CompareBeforeAfter (#12920) add 20aa0cf2f7 [ci] Re-enable roofline test (#13007) add 189338c919 [MetaSchedule] Support RewriteLayout postproc on AllocateConst (#12991) add d92d47ad7f [docs] fixed codebase_walkthrough document bug (#13008) add f21b5ca4ae [TIR][Schedule] Add cache_inplace primitive to cache opaque buffer (#12939) add aedbe45b6a [Node] Expose StructuralEqual/Hash handler implemenation to header (#13001) add f8666a9469 [ci] Template build steps (#12983) add 84c50ef045 [ci][docs] Add external network resources to S3 (#12989) add fcbcd156c7 [microTVM] Add Cortex-M DSP schedules for optimal conv2d layouts (#12969) add 0b034d77bd [Doc] Fix the typo in the debugging doc (#13032) add cf96072a04 [COMMUNITY] Add andrewzhaoluo key for release (#13025) add afeab6e8a3 [TVMScript] AST, Source and diagnostics for Parser (#12978) add cd077575f3 Update to 0.11.dev0 (#13036) add df0ffc3b50 [FIX,LOWERING] Add attrs from Relay Functions to PrimFuncs in Metaschedule Lowering (#13034) add 05d30f2502 [Docs] Update Release Process Docs -- Add Ver. Bump Step (#13039) add 4f975c7506 [Hexagon] [runtime] Support VTCM alignments of 128 or 2k (#12999) add cc235f88a4 [TIR] Update ir_comparator message to be more clear about what is being compared (#13038) add a752b74956 [Conda] Specify python version in build-environment (#13047) add 48829756e1 [TIR] Fix the stride calculation in InjectRollingBuffer (#13016) add b8cfc4ca82 [CI] Update Compute Library to v22.08 (#13035) add 256075566b [MetaSchedule] Allow skipping exact NDArray rewrite in RemoveWeightLayoutRewriteBlock (#13052) add b18f6425f9 [Hexagon] [runtime] Manage RPC and runtime buffers separately (#13028) add 90c666f860 [Relay][Pass] ConcretizeCastLikeRewrite for SimplifyExpr (#12923) add 61c9742ea7 [Hexagon] Enable multi input Async DMA; same queue / stage (#13037) add 46c16eed83 [microTVM][Windows] Enable building TVM on Windows with USE_MICRO (#12851) add f06896f38f [Relay] Fix simplifying consecutive casts when intermediate type has fewer bits (#13056) add 29a8f06066 [Arith] Optional rewriting and simplification into AND of ORs (#12972) add b389d4dac4 [Torch] Fix torch contrib issues (#13061) add f232272e75 [TIR] Refactor NarrowDataType into DataTypeLegalizer (#13049) add 493458e552 [TE] Raise error for non-bijective transformation (#12926) add 605876e638 [TEST] Fix the broken VNNI MetaSchedule test (#13067) add 44c35dcd96 [TVMScript] Fix parsing int64 loop with optional loop start (#13068) add 342ffb91d6 [Hexagon]Register fast softmax schedule with default schedule (#13083) add 5eab64885a [ROOFLINE] Add support for different dtypes (#13003) add 5ed94eefad [Node] Fix structural equal path tracing pointer usage (#13082) add 71f32ca4e8 [MetaSchedule][UX] Support Interactive Performance Table Printing in Notebook (#13006) add ec5c692148 Add include directory for OpenBLAS on RedHat (#13087) add 5e862d4e41 [Frontend][PyTorch]Fix keywords to canonicalize scale and zero point access for FX-quantized graphs (#13071) add 9299a29fd3 [TVMScript] Evaluator, core parser, var table (#13088) add cbca28da00 [MetaSchedule] Consolidate module hashing and equality testing (#13050) add e8ba1dc4cc [CI] Update Docker Image tag to 20221013-060115-61c9742ea (#13078) add 3f0d3f24b7 [AOT] Sanitize input/output name in runtime (#13046) add b1c8c90384 [skip ci] Added label tags links to the wiki page in issue templates to align with Issue Tracking RFC (#12988) add 34c43d43ea [MetaSchedule] Introduce a variant of ModuleEquality to enable ignoring NDArray raw data (#13091) add 42d9c38a4d [Doc] Fix typo in the document of installing from_source (#13090) add 69ba30e509 [TIR][FIX] Fix crash when using 'if' without 'else' in TVMScript (#13054) add 5dd786b3c9 [Hexagon] [runtime] VTCM bugfix, runtime buffer clarification (#13066) add 4074127b71 quic-sanirudh -> Reviewer (#13098) add c14f5e1e50 [ONNX] Handle multiple imports (#13065) add f4e9179601 [TIR] Fix handling of int64 extent in blockize and tensorize (#13069) add e3d3a1a62e [Relay][Frontend][ONNX] Add LayerNormalization operator (#13074) add 8ccc43445a [Hexagon] Async DMA pipelining test suite (#13005) add 8d2e887dbb [HotFix] Fix python import (#13099) add ee55333222 [Hexagon] Add feature to copy logcat to --hexagon-debug and add new --sysmon-profile option to run sysmon profiler during the test (#13107) add 468732c6b3 [PopenPool] Enable Stdout & Stderr Redirect in PopenPool & PopenWorker (#13112) add 9f047c0627 [DOCKER][Adreno]Docker infra for Adreno target with CLML support (#12833) add 48be4ff344 [Docs] Add instructions on downloads page updating on release process (#13106) add 64975a425f [skip ci][COMMUNITY] gigiblender -> Reviewer (#13122) add 6056e13db9 [Adreno] Fix winograd accuracy (#13117) add 010d05c680 [QNN][Hexagon] Disable QNN canonicalization pass (#12398) add 3d22dbffd0 [Relay] fix: add compute tag for trilu (#13120) add e3b722b70d [Hexagon] [runtime] Use malloc/free for RPC buffers (#13125) add 9c9f32536a Update Pytorch to version 1.12.0 and TorchVision to 0.13.0 (#13126) add 0b4836739c Skip stride check if shape is 1 in IsContiguous (#13121) add a4840e7de3 [ci] Lint for trailing newlines and spaces (#13058) new ab914915c9 Testing a new GPU image This update added new revisions after undoing existing revisions. That is to say, some revisions that were in the old version of the branch are not in the new version. This situation occurs when a user --force pushes a change and generates a repository containing something like this: * -- * -- B -- O -- O -- O (869a8f9591) \ N -- N -- N refs/heads/ci-docker-staging (ab914915c9) You should already have received notification emails for all of the O revisions, and so the following emails describe only the N revisions from the common base, B. Any revisions marked "omit" are not gone; other references still refer to them. Any revisions marked "discard" are gone forever. The 1 revisions listed above as "new" are entirely new to this repository and will be described in separate emails. The revisions listed as "add" were already present in the repository and have only been added to this reference. Summary of changes: .gitattributes | 1 - .github/ISSUE_TEMPLATE/bug-report.md | 11 +- .github/ISSUE_TEMPLATE/ci-image.md | 29 - .github/ISSUE_TEMPLATE/ci-problem.md | 13 +- .github/ISSUE_TEMPLATE/documentation.md | 10 +- .github/ISSUE_TEMPLATE/feature-tracking.md | 11 +- .github/ISSUE_TEMPLATE/flaky-test.md | 10 +- .github/actions/setup/action.yml | 1 - .github/dependabot.yml | 2 +- .../pr_comment_bot.yml | 0 .github/workflows/main.yml | 4 +- 3rdparty/libcrc/tab/gentab_ccitt.inc | 1 - CMakeLists.txt | 18 +- CONTRIBUTORS.md | 2 + Jenkinsfile | 286 ++++--- KEYS | 59 ++ NEWS.md | 2 - apps/android_camera/app/src/main/jni/Android.mk | 2 +- .../app/src/main/res/layout/activity_main.xml | 2 +- apps/android_camera/models/requirements.txt | 2 +- apps/android_deploy/app/build.gradle | 4 +- apps/android_deploy/app/download-models.gradle | 1 - apps/android_deploy/dev_tools/gen_keystore.sh | 4 +- apps/android_deploy/dev_tools/sign_apk.sh | 4 +- apps/android_deploy/gradle.properties | 2 +- apps/android_rpc/dev_tools/gen_keystore.sh | 4 +- apps/android_rpc/dev_tools/sign_apk.sh | 4 +- apps/bundle_deploy/README.md | 6 +- apps/cpp_rpc/CMakeLists.txt | 2 +- apps/hexagon_launcher/cmake/hexagon/CMakeLists.txt | 1 - apps/ios_rpc/.gitignore | 1 - apps/ios_rpc/README.md | 4 +- .../AppIcon.appiconset/Contents.json | 2 +- apps/microtvm/README.md | 2 - .../arduino/template_project/Makefile.template | 64 ++ .../template_project/microtvm_api_server.py | 271 +++++-- apps/microtvm/cmsisnn/README.md | 2 +- apps/microtvm/cmsisnn/corstone300.ld | 2 +- apps/microtvm/reference-vm/README.md | 4 +- .../reference-vm/base-box/base_box_provision.sh | 4 +- .../reference-vm/base-box/base_box_test.sh | 6 +- apps/microtvm/zephyr/README.md | 1 - .../zephyr/template_project/microtvm_api_server.py | 80 +- .../src/host_driven/fvp/semihost.c | 2 +- apps/microtvm/zephyr_cmsisnn/CMakeLists.txt | 18 +- apps/microtvm/zephyr_cmsisnn/README.md | 2 +- apps/microtvm/zephyr_cmsisnn/model/labels.txt | 2 +- apps/microtvm/zephyr_cmsisnn/run_demo.sh | 2 +- apps/pt_tvmdsoop/prepare_and_test_pt_tvm_class.sh | 5 +- apps/pt_tvmdsoop/tests/test_as_torch.py | 20 +- apps/tf_tvmdsoop/prepare_and_test_tfop_module.sh | 5 +- apps/vta_rpc/start_rpc_server.sh | 4 +- apps/wasm-standalone/README.md | 4 +- ci/jenkins/.gitignore | 2 +- ci/jenkins/Build.groovy.j2 | 328 ++++---- ci/jenkins/Deploy.groovy.j2 | 10 +- ci/jenkins/Jenkinsfile.j2 | 20 +- ci/jenkins/macros.j2 | 47 +- ci/scripts/git_skip_ci_globs.py | 2 + cmake/config.cmake | 15 + cmake/modules/Arduino.cmake | 1 + cmake/modules/Git.cmake | 2 +- cmake/modules/LibInfo.cmake | 2 + cmake/modules/Logging.cmake | 5 + cmake/modules/StandaloneCrt.cmake | 30 +- cmake/modules/contrib/BLAS.cmake | 5 + cmake/modules/contrib/BNNS.cmake | 1 - cmake/modules/contrib/CMSISNN.cmake | 4 +- cmake/modules/contrib/CODEGENC.cmake | 1 - cmake/modules/contrib/DNNL.cmake | 1 - cmake/modules/contrib/EthosN.cmake | 3 +- cmake/modules/contrib/Posit.cmake | 2 +- cmake/modules/contrib/TF_TVMDSOOP.cmake | 9 +- cmake/modules/contrib/Verilator.cmake | 1 - cmake/utils/CCache.cmake | 52 ++ cmake/utils/FindEthosN.cmake | 2 +- cmake/utils/FindVulkan.cmake | 2 +- cmake/utils/Summary.cmake | 1 + conda/build-environment.yaml | 1 + conda/condarc | 8 +- conda/recipe/meta.yaml | 2 +- ...nstall_cmake_source.sh => Dockerfile.ci_adreno} | 28 +- docker/Dockerfile.ci_minimal | 2 +- docker/README.md | 6 +- docker/bash.sh | 15 +- docker/install/ubuntu1804_manual_install_llvm.sh | 1 - .../ubuntu_download_arm_compute_lib_binaries.sh | 2 +- docker/install/ubuntu_install_androidsdk.sh | 5 +- docker/install/ubuntu_install_caffe2.sh | 4 +- docker/install/ubuntu_install_cmake_source.sh | 2 +- docker/install/ubuntu_install_cmsis.sh | 1 - docker/install/ubuntu_install_coreml.sh | 4 +- docker/install/ubuntu_install_dnnl.sh | 4 +- docker/install/ubuntu_install_gluoncv.sh | 4 +- docker/install/ubuntu_install_golang.sh | 4 +- docker/install/ubuntu_install_gradle.sh | 4 +- docker/install/ubuntu_install_hexagon.sh | 4 +- docker/install/ubuntu_install_java.sh | 4 +- docker/install/ubuntu_install_nodejs.sh | 2 +- docker/install/ubuntu_install_onnx.sh | 4 +- docker/install/ubuntu_install_opencl.sh | 4 +- docker/install/ubuntu_install_rocm.sh | 4 +- docker/install/ubuntu_install_universal.sh | 2 +- docker/install/ubuntu_install_vitis_ai_core.sh | 4 +- .../install/ubuntu_install_vitis_ai_packages_ci.sh | 4 +- docker/utils/apt-install-and-clear.sh | 1 - docs/_static/img/README | 2 +- docs/arch/hybrid_script.rst | 2 +- docs/conf.py | 2 +- docs/contribute/committer_guide.rst | 13 + docs/contribute/release_process.rst | 13 +- docs/dev/how_to/debugging_tvm.rst | 2 +- docs/dev/how_to/relay_add_op.rst | 56 +- docs/dev/tutorial/codebase_walkthrough.rst | 5 +- docs/errors.rst | 1 - docs/how_to/deploy/arm_compute_lib.rst | 2 +- docs/how_to/deploy/index.rst | 14 +- docs/how_to/deploy/vitis_ai.rst | 154 ++-- docs/how_to/profile/papi.rst | 10 +- docs/install/from_source.rst | 11 +- docs/reference/api/python/auto_scheduler.rst | 1 - docs/reference/api/python/topi.rst | 2 - docs/reference/langref/relay_pattern.rst | 2 +- docs/topic/vta/.gitignore | 2 +- docs/topic/vta/dev/config.rst | 1 - docs/topic/vta/dev/hardware.rst | 2 - docs/topic/vta/dev/index.rst | 2 +- docs/topic/vta/install.rst | 1 - .../tune_with_autoscheduler/tune_network_cuda.py | 2 +- .../tune_with_autoscheduler/tune_network_x86.py | 2 +- gallery/how_to/work_with_microtvm/micro_aot.py | 6 +- .../how_to/work_with_microtvm/micro_autotune.py | 6 +- .../work_with_microtvm/micro_reference_vm.py | 4 +- gallery/how_to/work_with_microtvm/micro_tflite.py | 2 +- gallery/how_to/work_with_microtvm/micro_train.py | 2 +- gallery/how_to/work_with_pytorch/using_as_torch.py | 159 ++++ .../work_with_pytorch/using_optimized_torch.py | 149 ++++ gallery/tutorial/tvmc_python.py | 2 +- include/tvm/arith/analyzer.h | 122 ++- include/tvm/ir/op.h | 13 +- include/tvm/meta_schedule/database.h | 63 +- include/tvm/meta_schedule/measure_callback.h | 7 +- include/tvm/meta_schedule/mutator.h | 15 +- include/tvm/meta_schedule/postproc.h | 9 + include/tvm/meta_schedule/schedule_rule.h | 18 +- include/tvm/meta_schedule/search_strategy.h | 32 +- include/tvm/meta_schedule/space_generator.h | 53 +- include/tvm/meta_schedule/task_scheduler.h | 282 +++---- include/tvm/meta_schedule/tune_context.h | 87 +- include/tvm/node/structural_equal.h | 42 + include/tvm/node/structural_hash.h | 38 + include/tvm/runtime/c_runtime_api.h | 2 +- include/tvm/runtime/container/array.h | 5 + include/tvm/runtime/crt/logging.h | 4 + include/tvm/runtime/crt/platform.h | 4 + include/tvm/runtime/crt/rpc_common/session.h | 13 + include/tvm/runtime/crt/rpc_common/write_stream.h | 2 + include/tvm/runtime/data_type.h | 10 + .../tvm/runtime/name_transforms.h | 36 +- include/tvm/runtime/ndarray.h | 9 + include/tvm/runtime/vm/vm.h | 91 ++- include/tvm/script/ir_builder/tir/ir.h | 8 + include/tvm/support/random_engine.h | 53 +- include/tvm/target/compilation_config.h | 2 +- include/tvm/target/target.h | 6 +- include/tvm/target/target_kind.h | 15 +- include/tvm/target/virtual_device.h | 10 +- include/tvm/tir/index_map.h | 24 +- include/tvm/tir/schedule/schedule.h | 10 + include/tvm/tir/stmt_functor.h | 51 ++ include/tvm/tir/transform.h | 19 +- include/tvm/topi/einsum.h | 889 +-------------------- jvm/conf/log4j.properties | 4 +- jvm/core/src/main/java/org/apache/tvm/Base.java | 1 - .../tvm/rpc/ConnectProxyServerProcessor.java | 4 +- .../src/test/java/org/apache/tvm/TestUtils.java | 4 +- licenses/LICENSE.builtin_fp16.txt | 2 +- licenses/LICENSE.cma.txt | 2 +- licenses/LICENSE.concurrentqueue.txt | 2 +- licenses/LICENSE.libbacktrace.txt | 6 +- licenses/LICENSE.picojson.txt | 2 +- mypy.ini | 1 - python/tvm/_ffi/libinfo.py | 2 +- python/tvm/auto_scheduler/feature.py | 8 +- python/tvm/contrib/hexagon/build.py | 137 +++- python/tvm/contrib/hexagon/pytest_plugin.py | 56 +- python/tvm/contrib/hexagon/session.py | 2 + python/tvm/contrib/popen_pool.py | 36 +- python/tvm/contrib/torch/as_torch.py | 83 +- python/tvm/contrib/torch/optimize_torch.py | 183 ++--- python/tvm/ir/expr.py | 7 +- python/tvm/meta_schedule/__init__.py | 20 +- python/tvm/meta_schedule/builder/builder.py | 44 +- python/tvm/meta_schedule/builder/local_builder.py | 8 +- python/tvm/meta_schedule/cost_model/cost_model.py | 46 +- python/tvm/meta_schedule/cost_model/mlp_model.py | 6 +- .../tvm/meta_schedule/cost_model/random_model.py | 3 +- python/tvm/meta_schedule/cost_model/xgb_model.py | 311 ++++--- python/tvm/meta_schedule/database/database.py | 89 ++- python/tvm/meta_schedule/database/json_database.py | 10 +- .../tvm/meta_schedule/database/memory_database.py | 20 +- .../meta_schedule/database/schedule_fn_database.py | 17 +- python/tvm/meta_schedule/default_config.py | 401 ---------- .../feature_extractor/feature_extractor.py | 26 +- .../feature_extractor/random_feature_extractor.py | 8 +- python/tvm/meta_schedule/logging.py | 263 ++++++ .../tvm/meta_schedule/measure_callback/__init__.py | 7 +- .../measure_callback/echo_statistics.py | 30 - .../measure_callback/measure_callback.py | 16 +- python/tvm/meta_schedule/mutator/mutator.py | 44 +- python/tvm/meta_schedule/postproc/postproc.py | 36 +- python/tvm/meta_schedule/profiler.py | 4 - python/tvm/meta_schedule/relay_integration.py | 378 +++++++-- python/tvm/meta_schedule/runner/local_runner.py | 6 +- python/tvm/meta_schedule/runner/rpc_runner.py | 4 +- python/tvm/meta_schedule/runner/runner.py | 37 +- .../meta_schedule/schedule_rule/schedule_rule.py | 35 + .../search_strategy/evolutionary_search.py | 10 - .../meta_schedule/search_strategy/replay_func.py | 11 +- .../meta_schedule/search_strategy/replay_trace.py | 12 +- .../search_strategy/search_strategy.py | 86 +- .../space_generator/post_order_apply.py | 26 +- .../meta_schedule/space_generator/schedule_fn.py | 20 +- .../space_generator/space_generator.py | 130 ++- .../space_generator/space_generator_union.py | 20 +- .../meta_schedule/task_scheduler/gradient_based.py | 49 +- .../meta_schedule/task_scheduler/round_robin.py | 77 +- .../meta_schedule/task_scheduler/task_scheduler.py | 276 +++---- .../meta_schedule/testing/dataset_extract_tasks.py | 4 +- .../testing/dataset_sample_candidates.py | 17 +- python/tvm/meta_schedule/testing/relay_workload.py | 52 +- python/tvm/meta_schedule/testing/schedule_rule.py | 36 - .../tvm/meta_schedule/testing/space_generation.py | 49 +- python/tvm/meta_schedule/testing/tlcbench.py | 5 +- .../testing/torchbench}/__init__.py | 0 python/tvm/meta_schedule/testing/torchbench/run.py | 584 ++++++++++++++ .../tvm/meta_schedule/testing/torchbench/utils.py | 103 +++ python/tvm/meta_schedule/testing/tune_onnx.py | 52 +- python/tvm/meta_schedule/testing/tune_relay.py | 44 +- python/tvm/meta_schedule/testing/tune_te.py | 41 +- python/tvm/meta_schedule/tir_integration.py | 155 ++++ python/tvm/meta_schedule/tune.py | 664 ++------------- python/tvm/meta_schedule/tune_context.py | 201 ++--- python/tvm/meta_schedule/utils.py | 159 ++-- python/tvm/micro/build.py | 26 +- python/tvm/micro/contrib/stm32/emitter.py | 8 +- python/tvm/micro/model_library_format.py | 32 +- python/tvm/micro/project_api/server.py | 84 ++ python/tvm/micro/session.py | 56 +- python/tvm/micro/testing/evaluation.py | 41 +- .../tvm/relay/backend/contrib/ethosu/tir/utils.py | 30 + .../backend/contrib/ethosu/tir_to_cs_translator.py | 37 +- python/tvm/relay/backend/name_transforms.py | 11 - python/tvm/relay/backend/te_compiler.py | 33 +- python/tvm/relay/build_module.py | 4 +- python/tvm/relay/collage/collage.py | 2 +- python/tvm/relay/frontend/onnx.py | 48 +- python/tvm/relay/frontend/pytorch.py | 59 ++ python/tvm/relay/frontend/qnn_torch.py | 4 +- python/tvm/relay/frontend/tflite.py | 2 +- python/tvm/relay/op/contrib/cmsisnn.py | 47 +- python/tvm/relay/op/contrib/ethosn.py | 84 +- python/tvm/relay/op/op_attrs.py | 60 ++ python/tvm/relay/op/reduce.py | 7 +- python/tvm/relay/op/strategy/arm_cpu.py | 89 ++- python/tvm/relay/op/strategy/hexagon.py | 53 +- python/tvm/relay/qnn/op/_qnn.py | 35 +- python/tvm/relay/qnn/op/qnn.py | 7 - .../tvm/relay/qnn/strategy/__init__.py | 7 +- python/tvm/relay/qnn/strategy/generic.py | 249 ++++++ python/tvm/relay/qnn/strategy/hexagon.py | 136 ++++ .../__init__.py => runtime/name_transforms.py} | 20 +- python/tvm/runtime/vm.py | 36 + .../tvm/script/_parser/__init__.py | 8 +- .../tvm/script/_parser/_core.py | 9 +- .../tvm/script/_parser/core/__init__.py | 6 +- python/tvm/script/_parser/core/diagnostics.py | 241 ++++++ python/tvm/script/_parser/core/dispatch.py | 156 ++++ python/tvm/script/_parser/core/doc.py | 462 +++++++++++ .../script/{printer => _parser/core}/doc_core.py | 0 .../dense.py => script/_parser/core/entry.py} | 34 +- python/tvm/script/_parser/core/evaluator.py | 509 ++++++++++++ python/tvm/script/_parser/core/parser.py | 647 +++++++++++++++ python/tvm/script/_parser/core/utils.py | 61 ++ python/tvm/script/ir_builder/tir/ir.py | 396 ++++++++- python/tvm/script/tir/__init__.py | 1 + python/tvm/script/tir/scope_handler.py | 42 +- python/tvm/script/tir/special_stmt.py | 68 +- python/tvm/script/tir/ty.py | 12 +- python/tvm/target/target.py | 4 + python/tvm/te/__init__.py | 1 + python/tvm/testing/utils.py | 185 ++++- python/tvm/tir/__init__.py | 1 + python/tvm/tir/function.py | 16 + python/tvm/tir/schedule/schedule.py | 99 ++- python/tvm/tir/tensor_intrin/cuda.py | 7 +- python/tvm/tir/transform/transform.py | 55 +- python/tvm/topi/adreno/conv2d_alter_op.py | 3 + python/tvm/topi/arm_cpu/conv2d.py | 16 + python/tvm/topi/arm_cpu/conv2d_alter_op.py | 38 +- python/tvm/topi/arm_cpu/depthwise_conv2d.py | 19 +- .../topi/arm_cpu/mprofile/dsp/depthwise_conv2d.py | 126 +-- .../arm_cpu/mprofile/dsp/micro_kernel/common.py | 15 + .../topi/arm_cpu/mprofile/dsp/micro_kernel/gemm.py | 8 +- .../dsp/micro_kernel/multi_channel_convolve.py | 210 +++++ .../dsp/micro_kernel/quad_channel_convolve.py | 180 ----- .../arm_cpu/mprofile/dsp/micro_kernel/tensordot.py | 155 ++++ .../topi/arm_cpu/mprofile/dsp/tensordot_conv2ds.py | 271 +++++++ python/tvm/topi/generic/conv2d.py | 11 +- python/tvm/topi/hexagon/__init__.py | 2 + python/tvm/topi/hexagon/conv2d.py | 49 +- python/tvm/topi/hexagon/conv2d_alter_op.py | 111 +++ python/tvm/topi/hexagon/dense.py | 73 +- python/tvm/topi/hexagon/dense_alter_op.py | 147 ++++ python/tvm/topi/hexagon/injective.py | 3 +- python/tvm/topi/hexagon/qnn/__init__.py | 3 + python/tvm/topi/hexagon/qnn/nn.py | 667 ++++++++++++++++ python/tvm/topi/hexagon/qnn/quantize.py | 80 ++ python/tvm/topi/hexagon/slice_ops/__init__.py | 1 + .../tvm/topi/hexagon/slice_ops/depth_to_space.py | 43 + python/tvm/topi/hexagon/slice_ops/max_pool2d.py | 55 +- python/tvm/topi/hexagon/tensor_intrin.py | 86 ++ python/tvm/topi/hexagon/utils.py | 5 + python/tvm/topi/transform.py | 2 +- python/tvm/topi/utils.py | 28 +- python/tvm/utils/roofline/__init__.py | 64 +- python/tvm/utils/roofline/cuda.py | 167 +++- python/tvm/utils/roofline/registry.py | 46 +- python/tvm/utils/roofline/x86.py | 261 +++--- rust/.rustfmt.toml | 1 - src/arith/analyzer.cc | 3 + src/arith/canonical_simplify.cc | 10 +- src/arith/conjunctive_normal_form.cc | 430 ++++++++++ .../conjunctive_normal_form.h} | 40 +- src/arith/iter_affine_map.cc | 45 +- src/arith/rewrite_simplify.cc | 90 ++- src/arith/rewrite_simplify.h | 32 +- src/arith/transitive_comparison_analyzer.cc | 791 ++++++++++++++++++ src/auto_scheduler/feature.cc | 7 +- src/auto_scheduler/search_policy/utils.h | 15 +- src/auto_scheduler/search_task.cc | 4 +- src/driver/driver_api.cc | 11 +- src/ir/expr.cc | 3 + src/meta_schedule/arg_info.cc | 3 +- src/meta_schedule/database/database.cc | 20 +- src/meta_schedule/database/json_database.cc | 22 +- src/meta_schedule/database/memory_database.cc | 13 +- src/meta_schedule/database/schedule_fn_database.cc | 7 +- .../feature_extractor/per_store_feature.cc | 27 +- .../measure_callback/add_to_database.cc | 6 +- .../measure_callback/echo_statistics.cc | 134 ---- .../measure_callback/measure_callback.cc | 10 + .../measure_callback/update_cost_model.cc | 11 +- src/meta_schedule/module_equality.cc | 87 ++ src/meta_schedule/module_equality.h | 75 ++ src/meta_schedule/mutator/mutator.cc | 30 + src/meta_schedule/postproc/postproc.cc | 47 ++ .../postproc/rewrite_cooperative_fetch.cc | 2 +- src/meta_schedule/postproc/rewrite_layout.cc | 58 +- .../schedule_rule/cross_thread_reduction.cc | 4 +- .../schedule_rule/multi_level_tiling.cc | 4 +- .../schedule_rule/multi_level_tiling.h | 2 +- .../multi_level_tiling_tensor_core.cc | 28 +- .../multi_level_tiling_wide_vector.cc | 6 + .../multi_level_tiling_with_intrin.cc | 6 +- src/meta_schedule/schedule_rule/schedule_rule.cc | 154 ++++ .../search_strategy/evolutionary_search.cc | 174 ++-- src/meta_schedule/search_strategy/replay_func.cc | 98 +-- src/meta_schedule/search_strategy/replay_trace.cc | 97 +-- .../search_strategy/search_strategy.cc | 5 +- .../space_generator/post_order_apply.cc | 35 +- src/meta_schedule/space_generator/schedule_fn.cc | 11 +- .../space_generator/space_generator.cc | 96 +++ .../space_generator/space_generator_union.cc | 16 +- src/meta_schedule/task_scheduler/gradient_based.cc | 195 ++--- src/meta_schedule/task_scheduler/round_robin.cc | 23 +- src/meta_schedule/task_scheduler/task_scheduler.cc | 408 +++++++--- src/meta_schedule/tune_context.cc | 180 +---- src/meta_schedule/utils.h | 107 ++- src/node/ndarray_hash_equal.h | 52 ++ src/node/structural_equal.cc | 140 +++- src/node/structural_hash.cc | 115 +-- src/printer/relay_text_printer.cc | 9 +- src/relay/backend/aot/aot_lower_main.cc | 5 +- src/relay/backend/aot_executor_codegen.cc | 3 +- src/relay/backend/build_module.cc | 4 +- src/relay/backend/contrib/cmsisnn/buffer_size.cc | 80 +- src/relay/backend/contrib/cmsisnn/buffer_size.h | 36 +- .../backend/contrib/cmsisnn/generate_constants.cc | 20 +- src/relay/backend/contrib/cmsisnn/relay_to_tir.cc | 52 +- .../contrib/cmsisnn/scalar_to_tensor_constant.cc | 6 +- .../backend/contrib/cmsisnn/tir_to_runtime.cc | 4 +- src/relay/backend/contrib/constant_transforms.cc | 58 ++ src/relay/backend/contrib/constant_transforms.h | 59 ++ src/relay/backend/contrib/ethosn/codegen.cc | 43 +- src/relay/backend/contrib/ethosn/codegen_ethosn.h | 1 + .../backend/contrib/ethosn/convert_equivalent.cc | 325 +++++--- src/relay/backend/contrib/ethosn/ethosn_api.cc | 85 +- src/relay/backend/contrib/ethosn/ethosn_api.h | 29 +- src/relay/backend/contrib/uma/targets.cc | 2 +- src/relay/backend/interpreter.cc | 6 +- src/relay/backend/name_transforms.cc | 12 +- src/relay/backend/name_transforms.h | 7 - src/relay/backend/task_extraction.cc | 34 +- src/relay/backend/te_compiler.cc | 2 +- src/relay/backend/te_compiler_cache.cc | 283 ++++++- src/relay/backend/utils.h | 7 + src/relay/backend/vm/compiler.cc | 6 +- src/relay/collage/README.md | 2 +- src/relay/ir/dataflow_matcher.cc | 5 + src/relay/op/tensor/math.cc | 2 +- src/relay/op/tensor/transform.cc | 24 +- src/relay/qnn/pass/legalize.cc | 2 +- src/relay/transforms/fuse_ops.cc | 4 +- .../transforms/meta_schedule_layout_rewrite.cc | 11 +- src/relay/transforms/partition_graph.cc | 5 +- src/relay/transforms/simplify_expr.cc | 33 +- src/runtime/aot_executor/aot_executor.cc | 11 +- src/runtime/builtin_fp16.cc | 3 - src/runtime/contrib/ethosn/ethosn_device.cc | 64 +- src/runtime/crt/host/Makefile | 2 +- src/runtime/hexagon/README.md | 1 - src/runtime/hexagon/hexagon_buffer.cc | 47 +- src/runtime/hexagon/hexagon_buffer_manager.h | 13 + src/runtime/hexagon/hexagon_device_api.cc | 48 +- src/runtime/hexagon/hexagon_device_api.h | 76 +- src/runtime/hexagon/hexagon_user_dma.cc | 28 +- src/runtime/hexagon/hexagon_user_dma.h | 36 +- src/runtime/hexagon/hexagon_vtcm_pool.cc | 164 ++++ src/runtime/hexagon/hexagon_vtcm_pool.h | 100 +++ src/runtime/hexagon/ring_buffer.h | 41 +- src/runtime/hexagon/rpc/android/session.cc | 2 +- src/runtime/hexagon/rpc/hexagon/rpc_server.cc | 34 +- src/runtime/hexagon/rpc/simulator/session.cc | 1 - src/runtime/logging.cc | 29 + src/runtime/micro/micro_session.cc | 9 + .../legalize.cc => runtime/name_transforms.cc} | 35 +- src/runtime/rpc/rpc_endpoint.cc | 5 + src/runtime/rpc/rpc_endpoint.h | 18 +- src/runtime/rpc/rpc_module.cc | 4 + src/runtime/rpc/rpc_session.h | 5 + src/runtime/vm/vm.cc | 215 ++++- src/runtime/vulkan/vulkan_device.h | 2 +- src/script/ir_builder/tir/ir.cc | 11 + src/support/libinfo.cc | 6 + src/target/compilation_config.cc | 23 +- src/target/llvm/codegen_llvm.cc | 227 ++++++ src/target/llvm/codegen_llvm.h | 8 + src/target/llvm/intrin_rule_hexagon.cc | 10 +- src/target/source/codegen_opencl.cc | 16 +- src/target/source/codegen_opencl.h | 1 + src/target/source/interface_c.cc | 1 + src/target/source/source_module.cc | 9 +- src/target/spirv/spirv_support.cc | 2 +- src/target/tag.cc | 9 + src/target/target.cc | 14 +- src/target/virtual_device.cc | 8 +- src/te/operation/create_primfunc.cc | 24 +- src/te/schedule/schedule_lang.cc | 2 +- src/tir/analysis/buffer_access_lca_detector.cc | 106 ++- src/tir/analysis/estimate_flops.cc | 6 +- src/tir/analysis/verify_memory.cc | 2 +- src/tir/ir/data_type_rewriter.cc | 195 +++++ src/tir/ir/index_map.cc | 199 +++-- src/tir/ir/stmt.cc | 2 +- src/tir/op/op.cc | 2 +- src/tir/schedule/analysis.h | 12 + src/tir/schedule/analysis/analysis.cc | 9 + src/tir/schedule/concrete_schedule.cc | 15 +- src/tir/schedule/concrete_schedule.h | 2 + src/tir/schedule/ir_comparator.cc | 20 +- src/tir/schedule/primitive.h | 14 +- src/tir/schedule/primitive/blockize_tensorize.cc | 2 +- src/tir/schedule/primitive/cache_read_write.cc | 362 +++++++-- src/tir/schedule/primitive/get_block_loop.cc | 10 +- .../schedule/primitive/layout_transformation.cc | 59 +- src/tir/schedule/schedule.cc | 2 + src/tir/schedule/state.cc | 1 + src/tir/schedule/traced_schedule.cc | 16 + src/tir/schedule/traced_schedule.h | 2 + src/tir/schedule/transform.cc | 25 +- src/tir/schedule/transform.h | 9 +- src/tir/transforms/inject_rolling_buffer.cc | 6 +- src/tir/transforms/inject_software_pipeline.cc | 67 +- src/tir/transforms/lower_async_dma.cc | 7 +- src/tir/transforms/make_packed_api.cc | 111 +-- src/tir/transforms/make_unpacked_api.cc | 9 +- src/tir/transforms/narrow_datatype.cc | 154 +--- .../remove_weight_layout_rewrite_block.cc | 213 ++++- src/tir/transforms/simplify.cc | 43 +- src/tir/transforms/texture_flatten.cc | 3 +- src/tir/usmp/analysis/extract_buffer_info.cc | 20 +- .../convert_pool_allocations_to_offsets.cc | 10 + src/topi/einsum.cc | 353 ++++++++ src/topi/transform.cc | 4 - .../hexagon/hexagon_device_api_tests.cc | 44 +- .../cpp-runtime/hexagon/hexagon_user_dma_tests.cc | 110 ++- .../cpp-runtime/hexagon/hexagon_vtcm_pool_tests.cc | 202 +++++ tests/cpp-runtime/hexagon/ring_buffer_tests.cc | 30 +- tests/cpp/data_type_rewriter_test.cc | 140 ++++ tests/cpp/name_transforms_test.cc | 1 + tests/cpp/ndarray_test.cc | 73 ++ .../backend/contrib/cmsisnn/buffer_size_test.cc | 86 +- tests/crt/contrib/stm32/Makefile | 2 +- tests/lint/check_file_type.py | 2 + tests/lint/docker-format.sh | 2 +- tests/lint/pylint.sh | 1 - tests/lint/trailing_newlines.py | 51 ++ .../whitespace.sh} | 33 +- tests/micro/arduino/README.md | 4 +- tests/micro/arduino/test_arduino_rpc_server.py | 2 +- tests/micro/arduino/test_utils.py | 2 +- tests/micro/common/test_tvmc.py | 14 +- .../test_arduino_microtvm_api_server.py | 26 +- tests/micro/project_api/test_project_api.py | 92 +++ tests/micro/zephyr/README.md | 6 +- tests/micro/zephyr/test_utils.py | 2 +- tests/micro/zephyr/test_zephyr.py | 6 +- tests/micro/zephyr/test_zephyr_aot_exec.py | 2 +- tests/python/ci/sample_prs/pr10786.json | 2 +- .../python/contrib/test_clml/conftest.py | 10 +- tests/python/contrib/test_clml/infrastructure.py | 43 +- tests/python/contrib/test_clml/test_network.py | 52 +- tests/python/contrib/test_clml/test_ops.py | 74 +- tests/python/contrib/test_cmsisnn/test_conv2d.py | 142 ++-- tests/python/contrib/test_cmsisnn/utils.py | 65 +- tests/python/contrib/test_ethosn/test_addition.py | 70 +- .../test_ethosn/test_convert_equivalents.py | 318 +++++++- .../contrib/test_ethosn/test_fullyconnected.py | 56 +- tests/python/contrib/test_ethosn/test_multiply.py | 102 ++- tests/python/contrib/test_ethosn/test_networks.py | 36 +- tests/python/contrib/test_ethosn/test_split.py | 13 - .../python/contrib/test_ethosn/test_topologies.py | 17 +- tests/python/contrib/test_ethosu/infra.py | 2 +- tests/python/contrib/test_ethosu/test_codegen.py | 2 +- .../contrib/test_ethosu/test_create_tiles.py | 14 +- .../contrib/test_ethosu/test_layout_optimizer.py | 2 +- .../contrib/test_ethosu/test_lut_optimizer.py | 2 +- .../test_ethosu/test_replace_depthwise_conv2d.py | 5 +- tests/python/contrib/test_hexagon/README.md | 2 +- tests/python/contrib/test_hexagon/conv2d/README.md | 2 +- .../test_hexagon/conv2d/test_conv2d_blocked.md | 38 +- .../test_hexagon/conv2d/test_conv2d_conv2d.md | 50 +- .../python/contrib/test_hexagon/infrastructure.py | 12 +- .../test_hexagon/test_2d_physical_buffers.py | 16 +- .../test_hexagon/test_async_dma_pipeline.py | 353 ++++++++ tests/python/contrib/test_hexagon/test_autotvm.py | 11 +- .../test_hexagon/test_benchmark_elemwise_add.py | 8 +- .../test_hexagon/test_benchmark_maxpool2d.py | 12 +- .../contrib/test_hexagon/test_cache_read_write.py | 5 +- .../test_hexagon/test_fixed_point_multiply.py | 8 +- tests/python/contrib/test_hexagon/test_launcher.py | 166 +++- .../contrib/test_hexagon/test_memory_alloc.py | 11 +- .../contrib/test_hexagon/test_meta_schedule.py | 304 ++++++- tests/python/contrib/test_hexagon/test_models.py | 9 +- .../contrib/test_hexagon/test_parallel_hvx.py | 13 +- .../test_hexagon/test_parallel_hvx_load_vtcm.py | 16 +- .../contrib/test_hexagon/test_parallel_scalar.py | 11 +- tests/python/contrib/test_hexagon/test_sigmoid.py | 113 +++ .../test_hexagon/test_software_pipeline_async.py | 187 ++++- .../contrib/test_hexagon/test_thread_pool.py | 10 +- .../contrib/test_hexagon/test_vtcm_bandwidth.py | 12 +- .../test_hexagon/test_wo_qnn_canonicalization.py | 185 +++++ .../topi/test_add_subtract_multiply.py | 10 +- .../contrib/test_hexagon/topi/test_argmax_slice.py | 6 +- .../test_hexagon/topi/test_avg_pool2d_slice.py | 12 +- .../contrib/test_hexagon/topi/test_batch_matmul.py | 11 +- .../contrib/test_hexagon/topi/test_cast_slice.py | 14 +- .../python/contrib/test_hexagon/topi/test_clip.py | 9 +- .../test_hexagon/topi/test_conv2d_fp16_intrin.py | 9 +- .../contrib/test_hexagon/topi/test_conv2d_nchw.py | 6 +- .../contrib/test_hexagon/topi/test_conv2d_nhwc.py | 10 +- .../contrib/test_hexagon/topi/test_conv2d_slice.py | 8 +- .../test_hexagon/topi/test_conv2d_transpose.py | 11 +- .../python/contrib/test_hexagon/topi/test_dense.py | 9 +- .../test_hexagon/topi/test_depth_to_space.py | 133 +++ .../test_hexagon/topi/test_depthwise_conv2d.py | 7 +- .../test_hexagon/topi/test_dequantize_slice.py | 11 +- .../test_hexagon/topi/test_dwconv2d_slice.py | 8 +- .../test_hexagon/topi/test_max_pool2d_slice.py | 114 +-- tests/python/contrib/test_hexagon/topi/test_pad.py | 7 +- .../contrib/test_hexagon/topi/test_pooling.py | 12 +- .../contrib/test_hexagon/topi/test_quantize.py | 124 +++ .../contrib/test_hexagon/topi/test_reduce.py | 8 +- .../contrib/test_hexagon/topi/test_relu_slice.py | 9 +- .../contrib/test_hexagon/topi/test_reshape.py | 10 +- .../contrib/test_hexagon/topi/test_resize2d.py | 5 +- .../contrib/test_hexagon/topi/test_softmax.py | 9 +- .../test_hexagon/topi/test_softmax_slice.py | 3 +- .../contrib/test_hexagon/topi/test_tanh_slice.py | 6 +- tests/python/frontend/onnx/test_forward.py | 20 +- tests/python/frontend/pytorch/test_forward.py | 29 + ...le_auto_tensorize.py => test_auto_tensorize.py} | 207 ++--- .../{test_tuning.py => test_legacy_tuning.py} | 0 tests/python/integration/test_tuning.py | 433 ++-------- tests/python/relay/aot/test_cpp_aot.py | 41 +- tests/python/relay/aot/test_crt_aot_usmp.py | 2 +- tests/python/relay/backend/test_pass_lower_te.py | 20 +- .../relay/collage/demo_collage_partitioner.py | 12 +- tests/python/relay/collage/menangerie.py | 4 +- tests/python/relay/collage/test_sub_graph.py | 36 +- .../opencl_texture/test_conv2d_nchw_texture.py | 75 +- .../opencl_texture/test_conv2d_nhwc_texture.py | 101 +++ .../relay/strategy/arm_cpu/test_conv2d_nhwc.py | 36 +- .../strategy/arm_cpu/test_depthwise_conv2d.py | 70 +- tests/python/relay/test_dataflow_pattern.py | 30 + tests/python/relay/test_ir_parser.py | 2 +- tests/python/relay/test_name_transforms.py | 4 +- tests/python/relay/test_op_level1.py | 2 +- tests/python/relay/test_op_level10.py | 2 +- tests/python/relay/test_op_level2.py | 65 +- tests/python/relay/test_op_level3.py | 19 + tests/python/relay/test_pass_collage_partition.py | 22 +- tests/python/relay/test_pass_convert_op_layout.py | 49 ++ .../relay/test_pass_dead_code_elimination.py | 2 +- tests/python/relay/test_pass_merge_composite.py | 14 +- tests/python/relay/test_pass_plan_devices.py | 6 +- tests/python/relay/test_pass_simplify_expr.py | 18 + tests/python/relay/test_target_hooks.py | 4 +- tests/python/relay/test_vm.py | 92 ++- .../transform/test_compiler_function_utils.py | 4 +- tests/python/topi/python/test_topi_einsum.py | 36 +- .../python/unittest/test_auto_scheduler_feature.py | 13 +- tests/python/unittest/test_index_map.py | 68 +- .../unittest/test_meta_schedule_cost_model.py | 109 ++- .../test_meta_schedule_custom_rule_winograd_cpu.py | 4 - ...test_meta_schedule_custom_rule_winograd_cuda.py | 3 - ...schedule_feature_extractor_per_store_feature.py | 62 +- .../unittest/test_meta_schedule_integration.py | 343 -------- .../test_meta_schedule_measure_callback.py | 18 +- ...eta_schedule_mutator_mutate_compute_location.py | 17 +- .../test_meta_schedule_mutator_mutate_parallel.py | 17 +- ..._meta_schedule_mutator_mutate_thread_binding.py | 17 +- .../test_meta_schedule_mutator_mutate_tile_size.py | 15 +- .../test_meta_schedule_mutator_mutate_unroll.py | 17 +- .../test_meta_schedule_post_order_apply.py | 50 +- ...meta_schedule_postproc_disallow_dynamic_loop.py | 21 +- ..._schedule_postproc_rewrite_cooperative_fetch.py | 22 +- .../test_meta_schedule_postproc_rewrite_layout.py | 137 +++- ...ta_schedule_postproc_rewrite_reduction_block.py | 21 +- ...est_meta_schedule_postproc_rewrite_tensorize.py | 26 +- ...meta_schedule_postproc_rewrite_unbound_block.py | 26 +- .../test_meta_schedule_postproc_verify_gpu_code.py | 23 +- .../test_meta_schedule_relay_integration.py | 631 +++++++++++++++ ...test_meta_schedule_schedule_rule_add_rfactor.py | 23 +- .../test_meta_schedule_schedule_rule_auto_bind.py | 33 +- ...test_meta_schedule_schedule_rule_auto_inline.py | 43 +- ...chedule_schedule_rule_cross_thread_reduction.py | 51 +- .../test_meta_schedule_schedule_rule_mlt.py | 61 +- .../test_meta_schedule_schedule_rule_mlt_intrin.py | 19 +- .../test_meta_schedule_schedule_rule_mlt_tc.py | 430 +++++----- ...dule_schedule_rule_parallel_vectorize_unroll.py | 21 +- ...hedule_schedule_rule_random_compute_location.py | 13 +- .../unittest/test_meta_schedule_search_strategy.py | 45 +- .../unittest/test_meta_schedule_space_cpu.py | 106 +-- .../unittest/test_meta_schedule_space_cuda.py | 116 +-- .../unittest/test_meta_schedule_space_generator.py | 7 +- .../unittest/test_meta_schedule_task_scheduler.py | 155 ++-- .../unittest/test_meta_schedule_tune_relay.py | 553 ------------- .../python/unittest/test_meta_schedule_tune_te.py | 52 -- .../python/unittest/test_meta_schedule_tune_tir.py | 115 ++- .../test_meta_schedule_vnni_integration.py | 250 ++++++ tests/python/unittest/test_micro_project_api.py | 35 + tests/python/unittest/test_roofline.py | 119 +-- tests/python/unittest/test_runtime_profiling.py | 2 +- tests/python/unittest/test_slice_tir.py | 216 +++++ tests/python/unittest/test_target_codegen_llvm.py | 7 +- tests/python/unittest/test_target_codegen_x86.py | 74 +- tests/python/unittest/test_target_target.py | 4 +- tests/python/unittest/test_te_create_primfunc.py | 2 + .../test_tir_analysis_estimate_tir_flops.py | 15 + tests/python/unittest/test_tir_constructor.py | 11 +- .../python/unittest/test_tir_lower_match_buffer.py | 4 +- .../python/unittest/test_tir_schedule_analysis.py | 5 +- .../python/unittest/test_tir_schedule_blockize.py | 35 + .../unittest/test_tir_schedule_cache_read_write.py | 121 ++- tests/python/unittest/test_tir_schedule_reindex.py | 151 +++- .../python/unittest/test_tir_schedule_tensorize.py | 85 ++ tests/python/unittest/test_tir_schedule_trace.py | 1 - .../unittest/test_tir_schedule_transform_layout.py | 114 ++- .../python/unittest/test_tir_schedule_utilities.py | 52 ++ .../test_tir_transform_compact_buffer_region.py | 2 +- .../test_tir_transform_inject_rolling_buffer.py | 25 +- .../test_tir_transform_inject_software_pipeline.py | 67 ++ .../unittest/test_tir_transform_make_packed_api.py | 5 +- ...sform_plan_update_buffer_allocation_location.py | 109 ++- .../python/unittest/test_tir_transform_simplify.py | 271 ++++++- ...ransform_convert_pool_allocations_to_offsets.py | 93 +++ tests/python/unittest/test_transform_layout.py | 13 + .../unittest/test_tvm_testing_before_after.py | 49 +- .../unittest/test_tvmscript_ir_builder_tir.py | 15 + .../unittest/test_tvmscript_parser_evaluator.py | 63 ++ .../unittest/test_tvmscript_parser_source.py | 86 ++ tests/python/unittest/test_tvmscript_roundtrip.py | 46 ++ .../python/unittest/test_tvmscript_syntax_sugar.py | 51 ++ tests/scripts/ci.py | 34 +- tests/scripts/release/.gitignore | 1 - .../scripts/release/PRERELEASE_NOTES.md | 7 +- tests/scripts/release/README.md | 2 +- tests/scripts/request_hook/request_hook.py | 40 +- tests/scripts/task_build_adreno_bins.sh | 53 ++ ...ig_build_jvm.sh => task_config_build_adreno.sh} | 17 +- tests/scripts/task_config_build_arm.sh | 1 + tests/scripts/task_config_build_cortexm.sh | 2 +- tests/scripts/task_config_build_cpu.sh | 1 + tests/scripts/task_config_build_gpu.sh | 1 + tests/scripts/task_config_build_gpu_other.sh | 1 + tests/scripts/task_config_build_hexagon.sh | 1 + tests/scripts/task_config_build_i386.sh | 2 +- tests/scripts/task_config_build_jvm.sh | 2 +- tests/scripts/task_cpp_unittest.sh | 1 - tests/scripts/task_lint.sh | 3 + tests/scripts/task_python_adreno.sh | 65 ++ tests/scripts/task_python_hexagon.sh | 2 +- tests/scripts/task_python_integration.sh | 2 +- tests/scripts/task_python_microtvm.sh | 4 +- version.py | 2 +- vta/README.md | 2 +- vta/tutorials/autotvm/README.txt | 1 - web/Makefile | 2 +- web/package.json | 2 +- web/src/compact.ts | 2 +- web/src/environment.ts | 2 +- web/src/index.ts | 2 +- web/src/support.ts | 2 +- web/tests/node/test_ndarray.js | 1 - 726 files changed, 26690 insertions(+), 10442 deletions(-) delete mode 100644 .github/ISSUE_TEMPLATE/ci-image.md rename .github/{workflows => disabled_workflows}/pr_comment_bot.yml (100%) create mode 100644 apps/microtvm/arduino/template_project/Makefile.template create mode 100644 cmake/utils/CCache.cmake copy docker/{install/ubuntu_install_cmake_source.sh => Dockerfile.ci_adreno} (64%) mode change 100755 => 100644 create mode 100644 gallery/how_to/work_with_pytorch/using_as_torch.py create mode 100644 gallery/how_to/work_with_pytorch/using_optimized_torch.py copy src/relay/qnn/pass/legalize.cc => include/tvm/runtime/name_transforms.h (59%) delete mode 100644 python/tvm/meta_schedule/default_config.py create mode 100644 python/tvm/meta_schedule/logging.py delete mode 100644 python/tvm/meta_schedule/measure_callback/echo_statistics.py delete mode 100644 python/tvm/meta_schedule/testing/schedule_rule.py copy python/tvm/{contrib/cuda_graph => meta_schedule/testing/torchbench}/__init__.py (100%) create mode 100644 python/tvm/meta_schedule/testing/torchbench/run.py create mode 100644 python/tvm/meta_schedule/testing/torchbench/utils.py create mode 100644 python/tvm/meta_schedule/tir_integration.py copy docker/utils/apt-install-and-clear.sh => python/tvm/relay/qnn/strategy/__init__.py (83%) mode change 100755 => 100644 create mode 100644 python/tvm/relay/qnn/strategy/generic.py create mode 100644 python/tvm/relay/qnn/strategy/hexagon.py copy python/tvm/{meta_schedule/measure_callback/__init__.py => runtime/name_transforms.py} (70%) copy docker/utils/apt-install-and-clear.sh => python/tvm/script/_parser/__init__.py (91%) mode change 100755 => 100644 copy docker/utils/apt-install-and-clear.sh => python/tvm/script/_parser/_core.py (87%) mode change 100755 => 100644 copy docker/utils/apt-install-and-clear.sh => python/tvm/script/_parser/core/__init__.py (87%) mode change 100755 => 100644 create mode 100644 python/tvm/script/_parser/core/diagnostics.py create mode 100644 python/tvm/script/_parser/core/dispatch.py create mode 100644 python/tvm/script/_parser/core/doc.py rename python/tvm/script/{printer => _parser/core}/doc_core.py (100%) copy python/tvm/{topi/hexagon/dense.py => script/_parser/core/entry.py} (54%) create mode 100644 python/tvm/script/_parser/core/evaluator.py create mode 100644 python/tvm/script/_parser/core/parser.py create mode 100644 python/tvm/script/_parser/core/utils.py create mode 100644 python/tvm/topi/arm_cpu/mprofile/dsp/micro_kernel/multi_channel_convolve.py delete mode 100644 python/tvm/topi/arm_cpu/mprofile/dsp/micro_kernel/quad_channel_convolve.py create mode 100644 python/tvm/topi/arm_cpu/mprofile/dsp/micro_kernel/tensordot.py create mode 100644 python/tvm/topi/arm_cpu/mprofile/dsp/tensordot_conv2ds.py create mode 100644 python/tvm/topi/hexagon/conv2d_alter_op.py create mode 100644 python/tvm/topi/hexagon/dense_alter_op.py create mode 100644 python/tvm/topi/hexagon/qnn/nn.py create mode 100755 python/tvm/topi/hexagon/qnn/quantize.py create mode 100644 python/tvm/topi/hexagon/slice_ops/depth_to_space.py create mode 100644 src/arith/conjunctive_normal_form.cc copy src/{relay/qnn/pass/legalize.cc => arith/conjunctive_normal_form.h} (58%) create mode 100644 src/arith/transitive_comparison_analyzer.cc delete mode 100644 src/meta_schedule/measure_callback/echo_statistics.cc create mode 100644 src/meta_schedule/module_equality.cc create mode 100644 src/meta_schedule/module_equality.h create mode 100644 src/node/ndarray_hash_equal.h create mode 100644 src/relay/backend/contrib/constant_transforms.cc create mode 100644 src/relay/backend/contrib/constant_transforms.h create mode 100644 src/runtime/hexagon/hexagon_vtcm_pool.cc create mode 100644 src/runtime/hexagon/hexagon_vtcm_pool.h copy src/{relay/qnn/pass/legalize.cc => runtime/name_transforms.cc} (58%) create mode 100644 src/tir/ir/data_type_rewriter.cc create mode 100644 src/topi/einsum.cc create mode 100644 tests/cpp-runtime/hexagon/hexagon_vtcm_pool_tests.cc create mode 100644 tests/cpp/data_type_rewriter_test.cc create mode 100644 tests/cpp/ndarray_test.cc create mode 100755 tests/lint/trailing_newlines.py copy tests/{scripts/task_config_build_jvm.sh => lint/whitespace.sh} (60%) rename {apps/microtvm/arduino/template_project/tests => tests/micro/project_api}/test_arduino_microtvm_api_server.py (89%) create mode 100644 tests/micro/project_api/test_project_api.py copy docker/utils/apt-install-and-clear.sh => tests/python/contrib/test_clml/conftest.py (84%) mode change 100755 => 100644 mode change 100755 => 100644 tests/python/contrib/test_hexagon/test_2d_physical_buffers.py create mode 100644 tests/python/contrib/test_hexagon/test_async_dma_pipeline.py create mode 100644 tests/python/contrib/test_hexagon/test_sigmoid.py create mode 100644 tests/python/contrib/test_hexagon/test_wo_qnn_canonicalization.py create mode 100644 tests/python/contrib/test_hexagon/topi/test_depth_to_space.py create mode 100755 tests/python/contrib/test_hexagon/topi/test_quantize.py rename tests/python/integration/{test_meta_schedule_auto_tensorize.py => test_auto_tensorize.py} (70%) copy tests/python/integration/{test_tuning.py => test_legacy_tuning.py} (100%) delete mode 100644 tests/python/unittest/test_meta_schedule_integration.py create mode 100644 tests/python/unittest/test_meta_schedule_relay_integration.py delete mode 100644 tests/python/unittest/test_meta_schedule_tune_relay.py delete mode 100644 tests/python/unittest/test_meta_schedule_tune_te.py create mode 100644 tests/python/unittest/test_meta_schedule_vnni_integration.py create mode 100644 tests/python/unittest/test_slice_tir.py create mode 100644 tests/python/unittest/test_tvmscript_parser_evaluator.py create mode 100644 tests/python/unittest/test_tvmscript_parser_source.py copy apps/microtvm/zephyr/README.md => tests/scripts/release/PRERELEASE_NOTES.md (72%) create mode 100755 tests/scripts/task_build_adreno_bins.sh copy tests/scripts/{task_config_build_jvm.sh => task_config_build_adreno.sh} (75%) create mode 100755 tests/scripts/task_python_adreno.sh