Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
T
tic
Overview
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
wenyuanbo
tic
Commits
9089e196
Commit
9089e196
authored
5 years ago
by
Leyuan Wang
Committed by
Tianqi Chen
5 years ago
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
add more syncs (#3151)
parent
4332b0aa
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
0 deletions
+3
-0
topi/python/topi/cuda/nms.py
+3
-0
No files found.
topi/python/topi/cuda/nms.py
View file @
9089e196
...
...
@@ -133,6 +133,9 @@ def get_valid_counts_upsweep(data, idx_in, idx, partial):
idx
[
bx
*
num_anchors
+
tx
*
elem_per_thread
+
i
]
=
\
idx
[
bx
*
num_anchors
+
tx
*
elem_per_thread
+
i
-
1
]
+
\
idx_in
[
bx
*
num_anchors
+
tx
*
elem_per_thread
+
i
]
ib
.
emit
(
tvm
.
make
.
Call
(
None
,
'tvm_storage_sync'
,
tvm
.
convert
([
'shared'
]),
tvm
.
expr
.
Call
.
Intrinsic
,
None
,
0
))
return
ib
.
get
()
def
get_valid_counts_scan
(
data
,
partial_in
,
partial
):
...
...
This diff is collapsed.
Click to expand it.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment