Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
P
papertools
Overview
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
hanhusheng
papertools
Commits
2c426667
Commit
2c426667
authored
May 08, 2025
by
jiangdongchen
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
log
parent
f5127432
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
2 additions
and
2 deletions
+2
-2
README.md
+1
-1
psrc/stage1/citationProcess.py
+1
-1
No files found.
README.md
View file @
2c426667
...
...
@@ -66,7 +66,7 @@
4.
人工复核
1.
可能会有重复的pdf,只有标准化重命名之后才会发现,当两个pdf的索引名称相同时,说明pdf重复匹配,需要手动删除重复的pdf和excel表项,由人工结合原表格信息判定重复现象.
2.
可能机构和国家会有重复,请人工检查.
3.
stage2:
国家机构索引
、牛人判断
3.
stage2:
知名企业
、牛人判断
# 代码结构说明
1.
psrc文件夹下是库函数
...
...
psrc/stage1/citationProcess.py
View file @
2c426667
...
...
@@ -352,7 +352,7 @@ def citationProcess(config: dict):
if
cit_id
is
not
None
:
sheet
.
cell
(
row
=
idx
+
config
[
"content_start"
],
column
=
6
,
value
=
cit_id
)
# 第6列是cit_id
else
:
logging
.
warning
(
f
"{RED}cit_id is None.{RESET}"
)
logging
.
warning
(
f
"{RED}cit_id is None.
可能存在多个pdf版本,请找到真正引用的版本(T_T)
{RESET}"
)
authors_list
=
result_dict
.
get
(
"Authors"
,
[])
authors
=
";"
.
join
(
authors_list
)
if
isinstance
(
authors_list
,
list
)
else
""
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment