Commit d6a62c92 by jiangdongchen

extract journal and session

parent 7e4247f9
# 环境配置 # 环境配置
- 确保执行py的cwd在papertools仓库文件夹下 - 确保执行py的cwd在papertools仓库文件夹下
- 路径和参数配置都在config.json文件中 - 路径和参数配置都在config.json文件中
- api_key
- 目前的密钥是东辰同学自己从知乎上打广告赚来的,只有100块的额度,请尽量使用自己的密钥
- 如果使用不同的API的密钥注意更改openAI的调用方式,这里推荐硅基流动,因为我就是用硅基流动跑通的
- base_url
- api接口url
- pdf_dir
- 放置论文pdf的文件夹
- result_dir
- 输出关键信息json文件的文件夹
- source_excel_path
- 放置需要check的excel表格
- 第4行开始实际表项
- 第一列索引
- 第三列论文标题
- 第七列论文作者
- target_excel_path
- 输出的格式化表格
- logLevel - logLevel
- 取10表示DEBUG级别 - 取10表示DEBUG级别
- 取20表示INFO级别 - 取20表示INFO级别
- tableNum 需要处理的工作表数量 - sheetNum 需要处理的工作表数量
- maxItem 每个工作表的最大条目数 - maxItem 每个工作表的最大条目数
- python3.12.10 - python3.12
- 无法import的库使用pip install逐个安装 - 无法import的库使用pip install逐个安装
- `openai`, `pypdf` - `openai`, `pypdf`
- `python-Levenshtein` - `python-Levenshtein`
- 目前的密钥是东辰同学自己从知乎上打广告赚来的,只有100块的额度,请尽量使用自己的密钥
- 如果使用不同的API的密钥注意更改openAI的调用方式,这里推荐硅基流动,因为我就是用硅基流动跑通的
# 使用方法 # 使用方法
- 查看config.json正确配置参数,让程序能够找到需要的文件位置和参数 - 查看config.json正确配置参数,让程序能够找到需要的文件位置和参数
- 默认配置
- 文章的pdf分sheet放置在Papers/sheetname文件夹下
- 待check的excel表格放在others文件夹中
- 输出的表格放在target文件夹中, pdf会原地标准化重命名
- python main.py 执行程序 - python main.py 执行程序
- 程序执行过程中,不要打开target excel文件,不然会争用权限发生错误 - 程序执行过程中,不要打开target excel文件,不然会争用权限发生错误
- 多模型交叉验证 - 多模型交叉验证
...@@ -28,15 +47,19 @@ ...@@ -28,15 +47,19 @@
2. 输出无法下载的条目 2. 输出无法下载的条目
2. 自动化提取信息和格式化 2. 自动化提取信息和格式化
1. 通过config.json读取配置对象 1. 通过config.json读取配置对象
2. 遍历excel的工作表 2. **遍历**excel的sheet
1. 读取excel表格中的论文名称和索引 1. **遍历**sheet中的论文名称和索引
2. 循环: 1. 用大模型读取pdf中第一页的论文名称和关键信息,存储到json文件夹下
1. 读取pdf中的论文名称和关键信息,存储到json文件夹下 2. **遍历**excel表格中的论文名称进行模糊匹配
2. 和excel表格中的论文名称进行模糊匹配 1. 匹配成功后
3. 匹配成功后
1. 用pdf文件中的论文名称和索引标准化重命名pdf文件和excel表格中的论文名称 1. 用pdf文件中的论文名称和索引标准化重命名pdf文件和excel表格中的论文名称
2. 将pdf文件中的关键信息写入excel表格中, 包括作者姓名、机构、国家 2. 将pdf文件中的关键信息写入excel表格中, 包括
4. 匹配失败后,输出无法匹配的条目 - 标题
- 会议名称
- 作者姓名
- 机构
- 国家
2. 匹配失败后,输出无法匹配的条目
1. 使用warning记录无法匹配的条目,方便后续处理 1. 使用warning记录无法匹配的条目,方便后续处理
# 代码结构说明 # 代码结构说明
......
# crawler
## update
更新陈老师的文章列表信息`yunjichen.json`,后续`main.py`依赖此文件检索引用文献。
## main
创建`papers.txt`并在其中逐行加入你需要检索的论文标题,不建议输入完整标题,输入部分标题即可,如下所示:
```
Reproducing Concurrency Bugs Using Local Clocks
binary translator with post-optimization
timing error mitigation for hardware neural networks
A Polyvalent Machine Learning Accelerator
```
已爬取过的链接在`urls.txt`中记录,避免重复爬取,`main.py`文件可重复运行,爬取失败建议检查网络连接。
### 2. **运行脚本**
确保你的环境已安装必要依赖(如`pandas`, `openpyxl`, `tqdm`等)。
命令行运行:
```bash
python main.py
```
**指定年份**(仅爬取该年份的引用):
```bash
python main.py --year 2023
```
### 3. **查看结果**
抓取后将在`results/`下生成`citations.xlsx`,每个论文一个表单,包含被引论文的Title、URL、作者信息等。
------
## **参数说明**
- `--year` 指定年份,仅抓取该年度的引用。不加该参数则会爬取**所有年份**的引用信息。
**用法示例:**
- 爬取全部(默认): `python main.py`
- 仅抓2022年的引用: `python main.py --year 2022`
\ No newline at end of file
import os
import re
import string
import json
import pandas as pd
from tqdm import tqdm
from urllib.parse import urlparse, parse_qs, urlencode, urlunparse
import time
import random
import argparse
from bs4 import BeautifulSoup
import requests
import re
import string
JSON_FILE = "yunjichen.json"
PUB_FILE = "papers.txt"
URL_FILE = "urls.txt"
DATA_FILE = "citations.xlsx"
def get_cited_url_list(citedby_url, year=None):
prefix, suffix = citedby_url.split("oi=bibs&hl=en")
prefix += "start="
if year:
# Use the year parameter for filtering
suffix = f"&hl=en&as_sdt=2005&sciodt=0,5{suffix}&scipsc=&as_ylo={year}&as_yhi={year}&scisbd="
else:
# No year filter, fetch all
suffix = f"&hl=en&as_sdt=2005&sciodt=0,5{suffix}&scipsc="
for i in range(0, 10000, 10):
yield prefix + str(i) + suffix
SYMBOL_MORE_AUTHORS = "…"
# 添加请求头
headers = {
"User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3"
}
def parse_scholar_results(html_content):
"""
Parses HTML content to extract Google Scholar results.
Args:
html_content (str): The HTML content to parse.
Returns:
list: A list of dictionaries containing '标题', '作者', and '期刊信息'.
"""
soup = BeautifulSoup(html_content, "html.parser")
results = soup.find_all("div", class_="gs_r")
extracted_data = []
for result in results:
# Extract title
title_tag = result.find(["h2", "h3"], class_="gs_rt")
if title_tag:
a_tag = title_tag.find("a")
if a_tag:
title = a_tag.get_text()
else:
title = title_tag.get_text()
title = title.replace("[CITATION][C] ", "")
else:
title = "未找到标题"
# Extract authors and journal information
authors_journal_tag = result.find("div", class_="gs_a")
if authors_journal_tag:
authors_journal_text = authors_journal_tag.get_text()
parts = authors_journal_text.split("-")
if len(parts) >= 2:
authors = parts[0].strip()
journal_info = parts[1].split(",")[0].strip()
else:
authors = "格式不符合预期"
journal_info = "格式不符合预期"
else:
authors = "未找到作者和期刊信息"
journal_info = "未找到作者和期刊信息"
# Add extracted data to list
extracted_data.append(
{"标题": title, "作者": authors, "期刊信息": journal_info}
)
return extracted_data
def parse_html(url):
"""
return:
title: str
new_url: url to the paper
authors: str
more_authors: bool, if there are more authors than shown
"""
# print(f"Fetching {url}")
results = []
session = requests.Session()
response = session.get(url, headers=headers)
if response.status_code != 200:
# print("Failed to get the page")
return False, results
soup = BeautifulSoup(response.content, "html.parser")
papers = soup.find_all("div", class_="gs_r gs_or gs_scl")
if len(papers) == 0:
return False, results
for paper in papers:
div_title = paper.find("h3", class_="gs_rt")
try:
title = div_title.find("a").get_text()
new_url = div_title.find("a")["href"]
except:
# span blocks
title = div_title.find_all("span")[-1].get_text()
continue
authors = paper.find("div", class_="gs_a").get_text().split("-")[0].split(",")
authors[-1] = " ".join(authors[-1].split())
authors = ";".join(authors)
if SYMBOL_MORE_AUTHORS in authors[-1]:
results.append((title, new_url, authors, 1))
else:
results.append((title, new_url, authors, 0))
return True, results
def add_scisbd_sort(url):
"""确保url中有scisbd=1 按date排序"""
parsed_url = urlparse(url)
query = parse_qs(parsed_url.query)
query["scisbd"] = ["1"]
new_query = urlencode(query, doseq=True)
new_url = urlunparse(
(parsed_url.scheme, parsed_url.netloc, parsed_url.path, "", new_query, "")
)
return new_url
def main(publications, year):
if not os.path.exists(URL_FILE):
with open(URL_FILE, "w") as file:
file.write("")
publications = [pub.lower() for pub in publications]
with open(JSON_FILE, "r") as file:
author = json.load(file)
author_publications = [
pub for pub in author["publications"] if pub["container_type"] == "Publication"
]
titles = [pub["bib"]["title"] for pub in author_publications]
index_publications = []
for i, publication in enumerate(publications):
found = False
for idx, title in enumerate(titles):
if publication in title.lower():
found = True
index_publications.append(idx)
publications[i] = title
break
if not found:
index_publications.append(None)
for idx, publication in enumerate(publications):
with open(URL_FILE, "a+") as file:
file.seek(0)
know_urls = set([line.strip() for line in file.readlines()])
name = "_".join(publication.split())
name = re.sub(f"[{string.punctuation}]", "", name[:20])
print(f"Processing {publication}...to {name}")
columns = [
"paper idx",
"paper Title",
"Cite idx",
"Cite Title",
"URL",
"Authors",
"More Authors",
]
if os.path.exists(DATA_FILE) and name in pd.ExcelFile(DATA_FILE).sheet_names:
old_df = pd.read_excel(DATA_FILE, sheet_name=name)
data = (
old_df
if set(old_df.columns) == set(columns)
else pd.DataFrame(columns=columns)
)
current_start_idx = old_df["Cite idx"].max() if not old_df.empty else 0
else:
data = pd.DataFrame(columns=columns)
current_start_idx = 0
index_publication = index_publications[idx]
if index_publication is None:
continue
citations = author_publications[index_publication]["num_citations"]
citedby_url = author_publications[index_publication]["citedby_url"]
citation_count = current_start_idx
for i, url in tqdm(enumerate(get_cited_url_list(citedby_url, year))):
# url = add_scisbd_sort(url)
if url in know_urls:
continue
if i * 10 > citations:
break
# 随机延时
time.sleep(random.uniform(0.5, 2.0))
mark, results = parse_html(url)
if not mark:
break
for rec in results:
citation_count += 1
title = rec[0] if len(rec) > 0 else ""
cite_url = rec[1] if len(rec) > 1 else ""
authors = rec[2] if len(rec) > 2 else ""
more_authors = rec[3] if len(rec) > 3 else ""
row = {
"paper idx": idx + 1,
"paper Title": publication,
"Cite idx": citation_count,
"Cite Title": title,
"URL": cite_url,
"Authors": authors,
"More Authors": more_authors,
}
data = data._append(row, ignore_index=True)
know_urls.add(url)
if os.path.exists(DATA_FILE):
with pd.ExcelWriter(
DATA_FILE, mode="a", if_sheet_exists="replace", engine="openpyxl"
) as writer:
data.to_excel(writer, sheet_name=name, index=False)
else:
data.to_excel(DATA_FILE, sheet_name=name, index=False)
with open(URL_FILE, "w") as file:
file.writelines([url + "\n" for url in know_urls])
if __name__ == "__main__":
parser = argparse.ArgumentParser(
description="Crawl citations with optional year filtering"
)
parser.add_argument(
"--year", type=int, help="The year to filter citations by (optional)"
)
args = parser.parse_args()
# Load the publication file
with open(PUB_FILE, "r", encoding="utf-8") as file:
publications = file.readlines()
publications = [pub.strip().lower() for pub in publications]
main(publications, args.year)
anNao: A Machine-Learning Supercompute
from scholarly import scholarly
from scholarly import ProxyGenerator
import json
# Activates proxy because Google Scholar otherwise might block the IP address
pg = ProxyGenerator()
scholarly.use_proxy(pg, pg)
def main(name):
author = next(scholarly.search_author(name))
author = scholarly.fill((author),sections = ['publications'])
json.dump(author, open(f"{name}.json", "w"), indent=4)
if __name__=="__main__":
main("yunji chen")
\ No newline at end of file
This source diff could not be displayed because it is too large. You can view the blob instead.
...@@ -434,3 +434,339 @@ ...@@ -434,3 +434,339 @@
2025-05-08 01:07:56,576 - INFO - Change: 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices- A Survey.pdf -> 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices_ A Survey.pdf 2025-05-08 01:07:56,576 - INFO - Change: 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices- A Survey.pdf -> 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices_ A Survey.pdf
2025-05-08 01:07:56,576 - INFO - Processing ML_Processors_Are_Going_Multi-Core_A_performance_dream_or_a_scheduling_nightmare.pdf 2025-05-08 01:07:56,576 - INFO - Processing ML_Processors_Are_Going_Multi-Core_A_performance_dream_or_a_scheduling_nightmare.pdf
2025-05-08 01:08:07,998 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK" 2025-05-08 01:08:07,998 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:02:39,821 - INFO - 程序启动,日志文件保存在: C:\Users\17046\Documents\papertools\logs\citation_process.log
2025-05-08 14:02:40,300 - INFO - Processing sheet: j24-DianNao family
2025-05-08 14:02:40,302 - INFO - Processing 1-In-Datacenter Performance Analysis of a Tensor Processing Unit.pdf
2025-05-08 14:03:32,867 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:03:32,868 - INFO - Renamed: 1-In-Datacenter Performance Analysis of a Tensor Processing Unit.pdf -> 1-In-Datacenter Performance Analysis of a Tensor Processing Unit.pdf
2025-05-08 14:03:33,513 - INFO - Matched: 1-In-Datacenter Performance Analysis of a Tensor Processing Unit.pdf -> idx: 1, excel_name: In-datacenter performance analysis of a tensor processing unit
2025-05-08 14:03:33,514 - INFO - Change: 1-In-Datacenter Performance Analysis of a Tensor Processing Unit.pdf -> 1-In-Datacenter Performance Analysis of a Tensor Processing Unit.pdf
2025-05-08 14:03:33,514 - INFO - Processing 10-A Configurable Cloud-Scale DNN Processor for Real-Time AI.pdf
2025-05-08 14:03:51,609 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:03:51,624 - INFO - Renamed: 10-A Configurable Cloud-Scale DNN Processor for Real-Time AI.pdf -> 10-A Configurable Cloud-Scale DNN Processor for Real-Time AI.pdf
2025-05-08 14:03:52,289 - INFO - Matched: 10-A Configurable Cloud-Scale DNN Processor for Real-Time AI.pdf -> idx: 10, excel_name: A Configurable Cloud-Scale DNN Processor for Real-Time AI
2025-05-08 14:03:52,290 - INFO - Change: 10-A Configurable Cloud-Scale DNN Processor for Real-Time AI.pdf -> 10-A Configurable Cloud-Scale DNN Processor for Real-Time AI.pdf
2025-05-08 14:03:52,290 - INFO - Processing 11-From Cloud Down to Things_ An Overview of Machine Learning in Internet of Things.pdf
2025-05-08 14:04:05,852 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:04:05,867 - INFO - Renamed: 11-From Cloud Down to Things_ An Overview of Machine Learning in Internet of Things.pdf -> 11-From Cloud Down to Things_ An Overview of Machine Learning in Internet of Things.pdf
2025-05-08 14:04:06,493 - INFO - Matched: 11-From Cloud Down to Things_ An Overview of Machine Learning in Internet of Things.pdf -> idx: 11, excel_name: From Cloud Down to Things: An Overview of Machine Learning in Internet of Things
2025-05-08 14:04:06,493 - INFO - Change: 11-From Cloud Down to Things_ An Overview of Machine Learning in Internet of Things.pdf -> 11-From Cloud Down to Things_ An Overview of Machine Learning in Internet of Things.pdf
2025-05-08 14:04:06,494 - INFO - Processing 12-CASH_ Compiler Assisted Hardware Design for Improving DRAM Energy Efficiency in CNN Inference.pdf
2025-05-08 14:04:24,303 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:04:24,396 - INFO - Processing 13-Laius_ Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters.pdf
2025-05-08 14:04:45,574 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:04:45,598 - INFO - Renamed: 13-Laius_ Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters.pdf -> 13-Laius_ Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters.pdf
2025-05-08 14:04:46,165 - INFO - Matched: 13-Laius_ Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters.pdf -> idx: 13, excel_name: Laius: Towards latency awareness and improved utilization of spatial multitasking accelerators in datacenters
2025-05-08 14:04:46,165 - INFO - Change: 13-Laius_ Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters.pdf -> 13-Laius_ Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters.pdf
2025-05-08 14:04:46,165 - INFO - Processing 15-Being-ahead_ Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment.pdf
2025-05-08 14:04:58,732 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:04:58,755 - INFO - Renamed: 15-Being-ahead_ Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment.pdf -> 15-Being-ahead_ Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment.pdf
2025-05-08 14:04:59,375 - INFO - Matched: 15-Being-ahead_ Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment.pdf -> idx: 15, excel_name: Being-ahead: Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment
2025-05-08 14:04:59,375 - INFO - Change: 15-Being-ahead_ Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment.pdf -> 15-Being-ahead_ Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment.pdf
2025-05-08 14:04:59,375 - INFO - Processing 16-Cloud-backed mobile cognition Power-efficient deep learning in the autonomous vehicle era.pdf
2025-05-08 14:05:15,752 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:05:15,778 - INFO - Renamed: 16-Cloud-backed mobile cognition Power-efficient deep learning in the autonomous vehicle era.pdf -> 16-Cloud-backed mobile cognition Power-efficient deep learning in the autonomous vehicle era.pdf
2025-05-08 14:05:16,420 - INFO - Matched: 16-Cloud-backed mobile cognition Power-efficient deep learning in the autonomous vehicle era.pdf -> idx: 16, excel_name: Cloud-backed mobile cognition
2025-05-08 14:05:16,420 - INFO - Change: 16-Cloud-backed mobile cognition Power-efficient deep learning in the autonomous vehicle era.pdf -> 16-Cloud-backed mobile cognition Power-efficient deep learning in the autonomous vehicle era.pdf
2025-05-08 14:05:16,420 - INFO - Processing 17-ASRPU_ A Programmable Accelerator for Low-Power Automatic Speech Recognition.pdf
2025-05-08 14:05:26,857 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:05:26,876 - INFO - Renamed: 17-ASRPU_ A Programmable Accelerator for Low-Power Automatic Speech Recognition.pdf -> 17-ASRPU_ A Programmable Accelerator for Low-Power Automatic Speech Recognition.pdf
2025-05-08 14:05:27,460 - INFO - Matched: 17-ASRPU_ A Programmable Accelerator for Low-Power Automatic Speech Recognition.pdf -> idx: 17, excel_name: ASRPU: A Programmable Accelerator for Low-Power Automatic Speech Recognition
2025-05-08 14:05:27,461 - INFO - Change: 17-ASRPU_ A Programmable Accelerator for Low-Power Automatic Speech Recognition.pdf -> 17-ASRPU_ A Programmable Accelerator for Low-Power Automatic Speech Recognition.pdf
2025-05-08 14:05:27,461 - INFO - Processing 20-Energy-Efficient Machine Learning on the Edges.pdf
2025-05-08 14:05:39,124 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:05:39,142 - INFO - Renamed: 20-Energy-Efficient Machine Learning on the Edges.pdf -> 20-Energy-Efficient Machine Learning on the Edges.pdf
2025-05-08 14:05:39,720 - INFO - Matched: 20-Energy-Efficient Machine Learning on the Edges.pdf -> idx: 20, excel_name: Energy-Efficient Machine Learning on the Edges
2025-05-08 14:05:39,720 - INFO - Change: 20-Energy-Efficient Machine Learning on the Edges.pdf -> 20-Energy-Efficient Machine Learning on the Edges.pdf
2025-05-08 14:05:39,720 - INFO - Processing 23-Enabling Design Methodologies and Future Trends for Edge AI_ Specialization and Codesign.pdf
2025-05-08 14:06:01,734 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:06:01,769 - INFO - Renamed: 23-Enabling Design Methodologies and Future Trends for Edge AI_ Specialization and Codesign.pdf -> 23-Enabling Design Methodologies and Future Trends for Edge AI_ Specialization and Codesign.pdf
2025-05-08 14:06:02,332 - INFO - Matched: 23-Enabling Design Methodologies and Future Trends for Edge AI_ Specialization and Codesign.pdf -> idx: 23, excel_name: Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Codesign
2025-05-08 14:06:02,332 - INFO - Change: 23-Enabling Design Methodologies and Future Trends for Edge AI_ Specialization and Codesign.pdf -> 23-Enabling Design Methodologies and Future Trends for Edge AI_ Specialization and Codesign.pdf
2025-05-08 14:06:02,332 - INFO - Processing 26-Adversarial Deep Learning and Security with a Hardware Perspective.pdf
2025-05-08 14:06:11,040 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:06:11,069 - INFO - Renamed: 26-Adversarial Deep Learning and Security with a Hardware Perspective.pdf -> 26-Adversarial Deep Learning and Security with a Hardware Perspective.pdf
2025-05-08 14:06:11,664 - INFO - Matched: 26-Adversarial Deep Learning and Security with a Hardware Perspective.pdf -> idx: 26, excel_name: Adversarial Deep Learning and Security with a Hardware Perspective
2025-05-08 14:06:11,664 - INFO - Change: 26-Adversarial Deep Learning and Security with a Hardware Perspective.pdf -> 26-Adversarial Deep Learning and Security with a Hardware Perspective.pdf
2025-05-08 14:06:11,664 - INFO - Processing 27-Analysis and Optimization of Direct Convolution Execution on Multi-Core Processors.pdf
2025-05-08 14:06:23,307 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:06:23,345 - INFO - Renamed: 27-Analysis and Optimization of Direct Convolution Execution on Multi-Core Processors.pdf -> 27-Analysis and Optimization of Direct Convolution Execution on Multi-Core Processors.pdf
2025-05-08 14:06:23,913 - INFO - Matched: 27-Analysis and Optimization of Direct Convolution Execution on Multi-Core Processors.pdf -> idx: 27, excel_name: Analysis and Optimization of Direct Convolution Execution on Multi-Core Processors
2025-05-08 14:06:23,914 - INFO - Change: 27-Analysis and Optimization of Direct Convolution Execution on Multi-Core Processors.pdf -> 27-Analysis and Optimization of Direct Convolution Execution on Multi-Core Processors.pdf
2025-05-08 14:06:23,914 - INFO - Processing 3-A Domain-Specific Architecture for Deep Neural Networks.pdf
2025-05-08 14:06:37,247 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:06:37,254 - INFO - Renamed: 3-A Domain-Specific Architecture for Deep Neural Networks.pdf -> 3-A Domain-Specific Architecture for Deep Neural Networks.pdf
2025-05-08 14:06:37,812 - INFO - Matched: 3-A Domain-Specific Architecture for Deep Neural Networks.pdf -> idx: 3, excel_name: A domain-specific architecture for deep neural networks
2025-05-08 14:06:37,812 - INFO - Change: 3-A Domain-Specific Architecture for Deep Neural Networks.pdf -> 3-A Domain-Specific Architecture for Deep Neural Networks.pdf
2025-05-08 14:06:37,812 - INFO - Processing 31-A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms.pdf
2025-05-08 14:07:29,941 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:07:29,985 - INFO - Renamed: 31-A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms.pdf -> 31-A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms.pdf
2025-05-08 14:07:30,598 - INFO - Matched: 31-A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms.pdf -> idx: 31, excel_name: A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms
2025-05-08 14:07:30,598 - INFO - Change: 31-A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms.pdf -> 31-A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms.pdf
2025-05-08 14:07:30,599 - INFO - Processing 32-A Systolic Array with Activation Stationary Dataflow for Deep Fully-Connected Networks.pdf
2025-05-08 14:07:46,101 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:07:46,147 - INFO - Renamed: 32-A Systolic Array with Activation Stationary Dataflow for Deep Fully-Connected Networks.pdf -> 32-A Systolic Array with Activation Stationary Dataflow for Deep Fully-Connected Networks.pdf
2025-05-08 14:07:46,744 - INFO - Matched: 32-A Systolic Array with Activation Stationary Dataflow for Deep Fully-Connected Networks.pdf -> idx: 32, excel_name: A Systolic Array with Activation Stationary Dataflow for Deep Fully-Connected Networks
2025-05-08 14:07:46,744 - INFO - Change: 32-A Systolic Array with Activation Stationary Dataflow for Deep Fully-Connected Networks.pdf -> 32-A Systolic Array with Activation Stationary Dataflow for Deep Fully-Connected Networks.pdf
2025-05-08 14:07:46,745 - INFO - Processing 33-MG3MConv_ Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor.pdf
2025-05-08 14:07:56,382 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:07:56,433 - INFO - Renamed: 33-MG3MConv_ Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor.pdf -> 33-MG3MConv_ Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor.pdf
2025-05-08 14:07:57,027 - INFO - Matched: 33-MG3MConv_ Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor.pdf -> idx: 33, excel_name: MG3MConv: Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor
2025-05-08 14:07:57,027 - INFO - Change: 33-MG3MConv_ Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor.pdf -> 33-MG3MConv_ Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor.pdf
2025-05-08 14:07:57,027 - INFO - Processing 34-Heron_ Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators.pdf
2025-05-08 14:08:30,266 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:08:30,320 - INFO - Renamed: 34-Heron_ Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators.pdf -> 34-Heron_ Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators.pdf
2025-05-08 14:08:30,924 - INFO - Matched: 34-Heron_ Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators.pdf -> idx: 34, excel_name: Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators
2025-05-08 14:08:30,924 - INFO - Change: 34-Heron_ Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators.pdf -> 34-Heron_ Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators.pdf
2025-05-08 14:08:30,925 - INFO - Processing 38-Energy and Performance Improvements for Convolutional Accelerators Using Lightweight Address Translation Support.pdf
2025-05-08 14:08:46,401 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:08:46,467 - INFO - Renamed: 38-Energy and Performance Improvements for Convolutional Accelerators Using Lightweight Address Translation Support.pdf -> 38-Energy and Performance Improvements for Convolutional Accelerators Using Lightweight Address Translation Support.pdf
2025-05-08 14:08:46,998 - INFO - Matched: 38-Energy and Performance Improvements for Convolutional Accelerators Using Lightweight Address Translation Support.pdf -> idx: 38, excel_name: Energy and Performance Improvements for Convolutional Accelerators Using Lightweight Address Translation Support
2025-05-08 14:08:46,998 - INFO - Change: 38-Energy and Performance Improvements for Convolutional Accelerators Using Lightweight Address Translation Support.pdf -> 38-Energy and Performance Improvements for Convolutional Accelerators Using Lightweight Address Translation Support.pdf
2025-05-08 14:08:46,998 - INFO - Processing 42-Energy-efficient application programming for green cloud computing.pdf
2025-05-08 14:09:02,277 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:09:02,322 - INFO - Renamed: 42-Energy-efficient application programming for green cloud computing.pdf -> 42-Energy-efficient application programming for green cloud computing.pdf
2025-05-08 14:09:02,908 - INFO - Matched: 42-Energy-efficient application programming for green cloud computing.pdf -> idx: 42, excel_name: Energy-Efficient Application Programming for Green Cloud Computing
2025-05-08 14:09:02,908 - INFO - Change: 42-Energy-efficient application programming for green cloud computing.pdf -> 42-Energy-efficient application programming for green cloud computing.pdf
2025-05-08 14:09:02,909 - INFO - Processing 43-Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain-Specific Accelerators.pdf
2025-05-08 14:09:20,321 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:09:20,391 - INFO - Renamed: 43-Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain-Specific Accelerators.pdf -> 43-Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain- Specific Accelerators.pdf
2025-05-08 14:09:20,988 - INFO - Matched: 43-Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain-Specific Accelerators.pdf -> idx: 43, excel_name: Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their {Domain-Specific} Accelerators
2025-05-08 14:09:20,988 - INFO - Change: 43-Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain-Specific Accelerators.pdf -> 43-Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain- Specific Accelerators.pdf
2025-05-08 14:09:20,988 - INFO - Processing 44-Generator-Based Design of Custom Systems-on-Chip for Numerical Data Analysis.pdf
2025-05-08 14:09:33,487 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:09:33,538 - INFO - Renamed: 44-Generator-Based Design of Custom Systems-on-Chip for Numerical Data Analysis.pdf -> 44-Generator-Based Design of Custom Systems-on-Chip for Numerical Data Analysis.pdf
2025-05-08 14:09:34,125 - INFO - Matched: 44-Generator-Based Design of Custom Systems-on-Chip for Numerical Data Analysis.pdf -> idx: 44, excel_name: Generator-Based Design of Custom Systems-on-Chip for Numerical Data Analysis
2025-05-08 14:09:34,126 - INFO - Change: 44-Generator-Based Design of Custom Systems-on-Chip for Numerical Data Analysis.pdf -> 44-Generator-Based Design of Custom Systems-on-Chip for Numerical Data Analysis.pdf
2025-05-08 14:09:34,126 - INFO - Processing 47-Brain Inspired Computing_ A Systematic Survey and Future Trends.pdf
2025-05-08 14:09:46,256 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:09:46,301 - INFO - Renamed: 47-Brain Inspired Computing_ A Systematic Survey and Future Trends.pdf -> 47-Brain Inspired Computing_ A Systematic Survey and Future Trends.pdf
2025-05-08 14:09:46,871 - INFO - Matched: 47-Brain Inspired Computing_ A Systematic Survey and Future Trends.pdf -> idx: 47, excel_name: Brain Inspired Computing: A Systematic Survey and Future Trends
2025-05-08 14:09:46,871 - INFO - Change: 47-Brain Inspired Computing_ A Systematic Survey and Future Trends.pdf -> 47-Brain Inspired Computing_ A Systematic Survey and Future Trends.pdf
2025-05-08 14:09:46,872 - INFO - Processing 47-Brain-Inspired Computing_ A Systematic Survey and Future Trends.pdf
2025-05-08 14:10:12,309 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:10:12,354 - INFO - Renamed: 47-Brain-Inspired Computing_ A Systematic Survey and Future Trends.pdf -> 47-Brain-Inspired Computing_ A Systematic Survey and Future Trends.pdf
2025-05-08 14:10:12,948 - INFO - Matched: 47-Brain-Inspired Computing_ A Systematic Survey and Future Trends.pdf -> idx: 47, excel_name: Brain Inspired Computing: A Systematic Survey and Future Trends
2025-05-08 14:10:12,948 - INFO - Change: 47-Brain-Inspired Computing_ A Systematic Survey and Future Trends.pdf -> 47-Brain-Inspired Computing_ A Systematic Survey and Future Trends.pdf
2025-05-08 14:10:12,949 - INFO - Processing 49-Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices_ A Review.pdf
2025-05-08 14:10:35,940 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:10:36,021 - INFO - Renamed: 49-Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices_ A Review.pdf -> 49-Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices_ A Review.pdf
2025-05-08 14:10:36,614 - INFO - Matched: 49-Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices_ A Review.pdf -> idx: 49, excel_name: Efficient acceleration of deep learning inference on resource-constrained edge devices: A review
2025-05-08 14:10:36,614 - INFO - Change: 49-Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices_ A Review.pdf -> 49-Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices_ A Review.pdf
2025-05-08 14:10:36,614 - INFO - Processing 52-FUSION OF AI WITH IOT (AI2OT)_ PARADIGM, CURRENT TRENDS, FUTURE DIRECTIONS.pdf
2025-05-08 14:11:00,829 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:11:00,892 - INFO - Renamed: 52-FUSION OF AI WITH IOT (AI2OT)_ PARADIGM, CURRENT TRENDS, FUTURE DIRECTIONS.pdf -> 52-FUSION OF AI WITH IOT (AI2OT)_ PARADIGM, CURRENT TRENDS, FUTURE DIRECTIONS.pdf
2025-05-08 14:11:01,454 - INFO - Matched: 52-FUSION OF AI WITH IOT (AI2OT)_ PARADIGM, CURRENT TRENDS, FUTURE DIRECTIONS.pdf -> idx: 52, excel_name: FUSION OF AI WITH IOT (AI2OT): PARADIGM, CURRENT TRENDS, FUTURE DIRECTIONS
2025-05-08 14:11:01,454 - INFO - Change: 52-FUSION OF AI WITH IOT (AI2OT)_ PARADIGM, CURRENT TRENDS, FUTURE DIRECTIONS.pdf -> 52-FUSION OF AI WITH IOT (AI2OT)_ PARADIGM, CURRENT TRENDS, FUTURE DIRECTIONS.pdf
2025-05-08 14:11:01,454 - INFO - Processing 53-An Analog Nearest Class with Multiple Centroids Classifier Implementation, for Depth of Anesthesia Monitoring.pdf
2025-05-08 14:11:14,569 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:11:14,663 - INFO - Renamed: 53-An Analog Nearest Class with Multiple Centroids Classifier Implementation, for Depth of Anesthesia Monitoring.pdf -> 53-An Analog Nearest Class with Multiple Centroids Classifier Implementation, for Depth of Anesthesia Monitoring.pdf
2025-05-08 14:11:15,257 - INFO - Matched: 53-An Analog Nearest Class with Multiple Centroids Classifier Implementation, for Depth of Anesthesia Monitoring.pdf -> idx: 53, excel_name: An Analog Nearest Class with Multiple Centroids Classifier Implementation, for Depth of Anesthesia Monitoring
2025-05-08 14:11:15,257 - INFO - Change: 53-An Analog Nearest Class with Multiple Centroids Classifier Implementation, for Depth of Anesthesia Monitoring.pdf -> 53-An Analog Nearest Class with Multiple Centroids Classifier Implementation, for Depth of Anesthesia Monitoring.pdf
2025-05-08 14:11:15,257 - INFO - Processing 54-Floating Gate Transistor-Based Accurate Digital In-Memory Computing for Deep Neural Networks.pdf
2025-05-08 14:11:30,374 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:11:30,468 - INFO - Renamed: 54-Floating Gate Transistor-Based Accurate Digital In-Memory Computing for Deep Neural Networks.pdf -> 54-Floating Gate Transistor-Based Accurate Digital In-Memory Computing for Deep Neural Networks.pdf
2025-05-08 14:11:31,062 - INFO - Matched: 54-Floating Gate Transistor-Based Accurate Digital In-Memory Computing for Deep Neural Networks.pdf -> idx: 54, excel_name: Floating Gate Transistor‐Based Accurate Digital In‐Memory Computing for Deep Neural Networks
2025-05-08 14:11:31,065 - INFO - Change: 54-Floating Gate Transistor-Based Accurate Digital In-Memory Computing for Deep Neural Networks.pdf -> 54-Floating Gate Transistor-Based Accurate Digital In-Memory Computing for Deep Neural Networks.pdf
2025-05-08 14:11:31,065 - INFO - Processing 56-Design Methodologies and Tools for Energy-aware IoT-based Applications.pdf
2025-05-08 14:11:45,744 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:11:45,809 - INFO - Renamed: 56-Design Methodologies and Tools for Energy-aware IoT-based Applications.pdf -> 56-Design Methodologies and Tools for Energy-aware IoT-based Applications.pdf
2025-05-08 14:11:46,408 - INFO - Matched: 56-Design Methodologies and Tools for Energy-aware IoT-based Applications.pdf -> idx: 56, excel_name: Design Methodologies and Tools for Energy-aware IoT-based Applications
2025-05-08 14:11:46,409 - INFO - Change: 56-Design Methodologies and Tools for Energy-aware IoT-based Applications.pdf -> 56-Design Methodologies and Tools for Energy-aware IoT-based Applications.pdf
2025-05-08 14:11:46,409 - INFO - Processing 57-Arax_ A Runtime Framework for Decoupling Applications from Heterogeneous Accelerators.pdf
2025-05-08 14:12:01,871 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:12:01,951 - INFO - Renamed: 57-Arax_ A Runtime Framework for Decoupling Applications from Heterogeneous Accelerators.pdf -> 57-Arax_ A Runtime Framework for Decoupling Applications from Heterogeneous Accelerators.pdf
2025-05-08 14:12:02,556 - INFO - Matched: 57-Arax_ A Runtime Framework for Decoupling Applications from Heterogeneous Accelerators.pdf -> idx: 57, excel_name: Arax: a runtime framework for decoupling applications from heterogeneous accelerators
2025-05-08 14:12:02,556 - INFO - Change: 57-Arax_ A Runtime Framework for Decoupling Applications from Heterogeneous Accelerators.pdf -> 57-Arax_ A Runtime Framework for Decoupling Applications from Heterogeneous Accelerators.pdf
2025-05-08 14:12:02,556 - INFO - Processing 61-A carbon-nanotube-based tensor processing unit.pdf
2025-05-08 14:12:14,626 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:12:14,669 - INFO - Renamed: 61-A carbon-nanotube-based tensor processing unit.pdf -> 61-A carbon-nanotube-based tensor processing unit.pdf
2025-05-08 14:12:15,286 - INFO - Matched: 61-A carbon-nanotube-based tensor processing unit.pdf -> idx: 61, excel_name: A carbon-nanotube-based tensor processing unit
2025-05-08 14:12:15,286 - INFO - Change: 61-A carbon-nanotube-based tensor processing unit.pdf -> 61-A carbon-nanotube-based tensor processing unit.pdf
2025-05-08 14:12:15,286 - INFO - Processing 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices_ A Survey.pdf
2025-05-08 14:12:36,181 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 14:12:36,268 - INFO - Renamed: 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices_ A Survey.pdf -> 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices_ A Survey.pdf
2025-05-08 14:12:36,878 - INFO - Matched: 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices_ A Survey.pdf -> idx: 62, excel_name: Advancements in accelerating deep neural network inference on aiot devices: A survey
2025-05-08 14:12:36,879 - INFO - Change: 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices_ A Survey.pdf -> 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices_ A Survey.pdf
2025-05-08 14:12:36,879 - INFO - Processing ML_Processors_Are_Going_Multi-Core_A_performance_dream_or_a_scheduling_nightmare.pdf
2025-05-08 14:12:45,090 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:06:50,066 - INFO - 程序启动,日志文件保存在: C:\Users\17046\Documents\papertools\logs\citation_process.log
2025-05-08 15:06:50,546 - INFO - Processing sheet: j24-DianNao family
2025-05-08 15:06:50,549 - INFO - Processing 1-In-Datacenter Performance Analysis of a Tensor Processing Unit.pdf
2025-05-08 15:07:43,813 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:07:43,817 - INFO - Renamed: 1-In-Datacenter Performance Analysis of a Tensor Processing Unit.pdf -> 1-In-Datacenter_Performance_Analysis_of_a_Tensor_Processing_Unit.pdf
2025-05-08 15:07:44,482 - INFO - Matched: 1-In-Datacenter Performance Analysis of a Tensor Processing Unit.pdf -> idx: 1, excel_name: In-datacenter performance analysis of a tensor processing unit
2025-05-08 15:07:44,482 - INFO - Change: 1-In-Datacenter Performance Analysis of a Tensor Processing Unit.pdf -> 1-In-Datacenter_Performance_Analysis_of_a_Tensor_Processing_Unit.pdf
2025-05-08 15:07:44,482 - INFO - Processing 10-A Configurable Cloud-Scale DNN Processor for Real-Time AI.pdf
2025-05-08 15:08:03,241 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:08:03,254 - INFO - Renamed: 10-A Configurable Cloud-Scale DNN Processor for Real-Time AI.pdf -> 10-A_Configurable_Cloud-Scale_DNN_Processor_for_Real-Time_AI.pdf
2025-05-08 15:08:03,832 - INFO - Matched: 10-A Configurable Cloud-Scale DNN Processor for Real-Time AI.pdf -> idx: 10, excel_name: A Configurable Cloud-Scale DNN Processor for Real-Time AI
2025-05-08 15:08:03,832 - INFO - Change: 10-A Configurable Cloud-Scale DNN Processor for Real-Time AI.pdf -> 10-A_Configurable_Cloud-Scale_DNN_Processor_for_Real-Time_AI.pdf
2025-05-08 15:08:03,832 - INFO - Processing 11-From Cloud Down to Things_ An Overview of Machine Learning in Internet of Things.pdf
2025-05-08 15:08:18,001 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:08:18,018 - INFO - Renamed: 11-From Cloud Down to Things_ An Overview of Machine Learning in Internet of Things.pdf -> 11-From_Cloud_Down_to_Things__An_Overview_of_Machine_Learning_in_Internet_of_Things.pdf
2025-05-08 15:08:18,560 - INFO - Matched: 11-From Cloud Down to Things_ An Overview of Machine Learning in Internet of Things.pdf -> idx: 11, excel_name: From Cloud Down to Things: An Overview of Machine Learning in Internet of Things
2025-05-08 15:08:18,560 - INFO - Change: 11-From Cloud Down to Things_ An Overview of Machine Learning in Internet of Things.pdf -> 11-From_Cloud_Down_to_Things__An_Overview_of_Machine_Learning_in_Internet_of_Things.pdf
2025-05-08 15:08:18,560 - INFO - Processing 12-CASH_ Compiler Assisted Hardware Design for Improving DRAM Energy Efficiency in CNN Inference.pdf
2025-05-08 15:08:33,532 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:08:33,632 - INFO - Processing 13-Laius_ Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters.pdf
2025-05-08 15:08:56,044 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:08:56,063 - INFO - Renamed: 13-Laius_ Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters.pdf -> 13-Laius__Towards_Latency_Awareness_and_Improved_Utilization_of_Spatial_Multitasking_Accelerators_in_Datacenters.pdf
2025-05-08 15:08:56,608 - INFO - Matched: 13-Laius_ Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters.pdf -> idx: 13, excel_name: Laius: Towards latency awareness and improved utilization of spatial multitasking accelerators in datacenters
2025-05-08 15:08:56,609 - INFO - Change: 13-Laius_ Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters.pdf -> 13-Laius__Towards_Latency_Awareness_and_Improved_Utilization_of_Spatial_Multitasking_Accelerators_in_Datacenters.pdf
2025-05-08 15:08:56,609 - INFO - Processing 15-Being-ahead_ Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment.pdf
2025-05-08 15:09:06,991 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:09:07,024 - INFO - Renamed: 15-Being-ahead_ Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment.pdf -> 15-Being-ahead__Benchmarking_and_Exploring_Accelerators_for_Hardware-Efficient_AI_Deployment.pdf
2025-05-08 15:09:07,635 - INFO - Matched: 15-Being-ahead_ Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment.pdf -> idx: 15, excel_name: Being-ahead: Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment
2025-05-08 15:09:07,636 - INFO - Change: 15-Being-ahead_ Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment.pdf -> 15-Being-ahead__Benchmarking_and_Exploring_Accelerators_for_Hardware-Efficient_AI_Deployment.pdf
2025-05-08 15:09:07,636 - INFO - Processing 16-Cloud-backed mobile cognition Power-efficient deep learning in the autonomous vehicle era.pdf
2025-05-08 15:09:24,032 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:09:24,054 - INFO - Renamed: 16-Cloud-backed mobile cognition Power-efficient deep learning in the autonomous vehicle era.pdf -> 16-Cloud-backed_mobile_cognition_Power-efficient_deep_learning_in_the_autonomous_vehicle_era.pdf
2025-05-08 15:09:24,604 - INFO - Matched: 16-Cloud-backed mobile cognition Power-efficient deep learning in the autonomous vehicle era.pdf -> idx: 16, excel_name: Cloud-backed mobile cognition
2025-05-08 15:09:24,604 - INFO - Change: 16-Cloud-backed mobile cognition Power-efficient deep learning in the autonomous vehicle era.pdf -> 16-Cloud-backed_mobile_cognition_Power-efficient_deep_learning_in_the_autonomous_vehicle_era.pdf
2025-05-08 15:09:24,605 - INFO - Processing 17-ASRPU_ A Programmable Accelerator for Low-Power Automatic Speech Recognition.pdf
2025-05-08 15:09:34,935 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:09:34,957 - INFO - Renamed: 17-ASRPU_ A Programmable Accelerator for Low-Power Automatic Speech Recognition.pdf -> 17-ASRPU__A_Programmable_Accelerator_for_Low-Power_Automatic_Speech_Recognition.pdf
2025-05-08 15:09:35,539 - INFO - Matched: 17-ASRPU_ A Programmable Accelerator for Low-Power Automatic Speech Recognition.pdf -> idx: 17, excel_name: ASRPU: A Programmable Accelerator for Low-Power Automatic Speech Recognition
2025-05-08 15:09:35,540 - INFO - Change: 17-ASRPU_ A Programmable Accelerator for Low-Power Automatic Speech Recognition.pdf -> 17-ASRPU__A_Programmable_Accelerator_for_Low-Power_Automatic_Speech_Recognition.pdf
2025-05-08 15:09:35,540 - INFO - Processing 20-Energy-Efficient Machine Learning on the Edges.pdf
2025-05-08 15:09:48,112 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:09:48,127 - INFO - Renamed: 20-Energy-Efficient Machine Learning on the Edges.pdf -> 20-Energy-Efficient_Machine_Learning_on_the_Edges.pdf
2025-05-08 15:09:48,721 - INFO - Matched: 20-Energy-Efficient Machine Learning on the Edges.pdf -> idx: 20, excel_name: Energy-Efficient Machine Learning on the Edges
2025-05-08 15:09:48,722 - INFO - Change: 20-Energy-Efficient Machine Learning on the Edges.pdf -> 20-Energy-Efficient_Machine_Learning_on_the_Edges.pdf
2025-05-08 15:09:48,722 - INFO - Processing 23-Enabling Design Methodologies and Future Trends for Edge AI_ Specialization and Codesign.pdf
2025-05-08 15:10:11,745 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:10:11,777 - INFO - Renamed: 23-Enabling Design Methodologies and Future Trends for Edge AI_ Specialization and Codesign.pdf -> 23-Enabling_Design_Methodologies_and_Future_Trends_for_Edge_AI__Specialization_and_Codesign.pdf
2025-05-08 15:10:12,325 - INFO - Matched: 23-Enabling Design Methodologies and Future Trends for Edge AI_ Specialization and Codesign.pdf -> idx: 23, excel_name: Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Codesign
2025-05-08 15:10:12,325 - INFO - Change: 23-Enabling Design Methodologies and Future Trends for Edge AI_ Specialization and Codesign.pdf -> 23-Enabling_Design_Methodologies_and_Future_Trends_for_Edge_AI__Specialization_and_Codesign.pdf
2025-05-08 15:10:12,326 - INFO - Processing 26-Adversarial Deep Learning and Security with a Hardware Perspective.pdf
2025-05-08 15:10:17,312 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:10:17,341 - INFO - Renamed: 26-Adversarial Deep Learning and Security with a Hardware Perspective.pdf -> 26-Adversarial_Deep_Learning_and_Security_with_a_Hardware_Perspective.pdf
2025-05-08 15:10:17,935 - INFO - Matched: 26-Adversarial Deep Learning and Security with a Hardware Perspective.pdf -> idx: 26, excel_name: Adversarial Deep Learning and Security with a Hardware Perspective
2025-05-08 15:10:17,935 - INFO - Change: 26-Adversarial Deep Learning and Security with a Hardware Perspective.pdf -> 26-Adversarial_Deep_Learning_and_Security_with_a_Hardware_Perspective.pdf
2025-05-08 15:10:17,935 - INFO - Processing 27-Analysis and Optimization of Direct Convolution Execution on Multi-Core Processors.pdf
2025-05-08 15:10:35,921 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:10:35,961 - INFO - Renamed: 27-Analysis and Optimization of Direct Convolution Execution on Multi-Core Processors.pdf -> 27-Analysis_and_Optimization_of_Direct_Convolution_Execution_on_Multi-Core_Processors.pdf
2025-05-08 15:10:36,528 - INFO - Matched: 27-Analysis and Optimization of Direct Convolution Execution on Multi-Core Processors.pdf -> idx: 27, excel_name: Analysis and Optimization of Direct Convolution Execution on Multi-Core Processors
2025-05-08 15:10:36,528 - INFO - Change: 27-Analysis and Optimization of Direct Convolution Execution on Multi-Core Processors.pdf -> 27-Analysis_and_Optimization_of_Direct_Convolution_Execution_on_Multi-Core_Processors.pdf
2025-05-08 15:10:36,529 - INFO - Processing 3-A Domain-Specific Architecture for Deep Neural Networks.pdf
2025-05-08 15:10:46,116 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:10:46,119 - INFO - Renamed: 3-A Domain-Specific Architecture for Deep Neural Networks.pdf -> 3-A_Domain-Specific_Architecture_for_Deep_Neural_Networks.pdf
2025-05-08 15:10:46,650 - INFO - Matched: 3-A Domain-Specific Architecture for Deep Neural Networks.pdf -> idx: 3, excel_name: A domain-specific architecture for deep neural networks
2025-05-08 15:10:46,650 - INFO - Change: 3-A Domain-Specific Architecture for Deep Neural Networks.pdf -> 3-A_Domain-Specific_Architecture_for_Deep_Neural_Networks.pdf
2025-05-08 15:10:46,650 - INFO - Processing 31-A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms.pdf
2025-05-08 15:11:38,275 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:11:38,319 - INFO - Renamed: 31-A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms.pdf -> 31-A_Survey_on_Deep_Learning_Hardware_Accelerators_for_Heterogeneous_HPC_Platforms.pdf
2025-05-08 15:11:38,904 - INFO - Matched: 31-A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms.pdf -> idx: 31, excel_name: A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms
2025-05-08 15:11:38,904 - INFO - Change: 31-A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms.pdf -> 31-A_Survey_on_Deep_Learning_Hardware_Accelerators_for_Heterogeneous_HPC_Platforms.pdf
2025-05-08 15:11:38,904 - INFO - Processing 32-A Systolic Array with Activation Stationary Dataflow for Deep Fully-Connected Networks.pdf
2025-05-08 15:11:56,497 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:11:56,536 - INFO - Renamed: 32-A Systolic Array with Activation Stationary Dataflow for Deep Fully-Connected Networks.pdf -> 32-A_Systolic_Array_with_Activation_Stationary_Dataflow_for_Deep_Fully-Connected_Networks.pdf
2025-05-08 15:11:57,074 - INFO - Matched: 32-A Systolic Array with Activation Stationary Dataflow for Deep Fully-Connected Networks.pdf -> idx: 32, excel_name: A Systolic Array with Activation Stationary Dataflow for Deep Fully-Connected Networks
2025-05-08 15:11:57,074 - INFO - Change: 32-A Systolic Array with Activation Stationary Dataflow for Deep Fully-Connected Networks.pdf -> 32-A_Systolic_Array_with_Activation_Stationary_Dataflow_for_Deep_Fully-Connected_Networks.pdf
2025-05-08 15:11:57,074 - INFO - Processing 33-MG3MConv_ Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor.pdf
2025-05-08 15:12:06,123 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:12:06,171 - INFO - Renamed: 33-MG3MConv_ Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor.pdf -> 33-MG3MConv__Multi-Grained_Matrix-Multiplication-Mapping_Convolution_Algorithm_toward_the_SW26010_Processor.pdf
2025-05-08 15:12:06,759 - INFO - Matched: 33-MG3MConv_ Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor.pdf -> idx: 33, excel_name: MG3MConv: Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor
2025-05-08 15:12:06,759 - INFO - Change: 33-MG3MConv_ Multi-Grained Matrix-Multiplication-Mapping Convolution Algorithm toward the SW26010 Processor.pdf -> 33-MG3MConv__Multi-Grained_Matrix-Multiplication-Mapping_Convolution_Algorithm_toward_the_SW26010_Processor.pdf
2025-05-08 15:12:06,759 - INFO - Processing 34-Heron_ Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators.pdf
2025-05-08 15:12:37,048 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:12:37,100 - INFO - Renamed: 34-Heron_ Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators.pdf -> 34-Heron__Automatically_Constrained_High-Performance_Library_Generation_for_Deep_Learning_Accelerators.pdf
2025-05-08 15:12:37,789 - INFO - Matched: 34-Heron_ Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators.pdf -> idx: 34, excel_name: Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators
2025-05-08 15:12:37,789 - INFO - Change: 34-Heron_ Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators.pdf -> 34-Heron__Automatically_Constrained_High-Performance_Library_Generation_for_Deep_Learning_Accelerators.pdf
2025-05-08 15:12:37,790 - INFO - Processing 38-Energy and Performance Improvements for Convolutional Accelerators Using Lightweight Address Translation Support.pdf
2025-05-08 15:12:52,927 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:12:52,991 - INFO - Renamed: 38-Energy and Performance Improvements for Convolutional Accelerators Using Lightweight Address Translation Support.pdf -> 38-Energy_and_Performance_Improvements_for_Convolutional_Accelerators_Using_Lightweight_Address_Translation_Support.pdf
2025-05-08 15:12:53,584 - INFO - Matched: 38-Energy and Performance Improvements for Convolutional Accelerators Using Lightweight Address Translation Support.pdf -> idx: 38, excel_name: Energy and Performance Improvements for Convolutional Accelerators Using Lightweight Address Translation Support
2025-05-08 15:12:53,584 - INFO - Change: 38-Energy and Performance Improvements for Convolutional Accelerators Using Lightweight Address Translation Support.pdf -> 38-Energy_and_Performance_Improvements_for_Convolutional_Accelerators_Using_Lightweight_Address_Translation_Support.pdf
2025-05-08 15:12:53,585 - INFO - Processing 42-Energy-efficient application programming for green cloud computing.pdf
2025-05-08 15:13:09,921 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:13:09,965 - INFO - Renamed: 42-Energy-efficient application programming for green cloud computing.pdf -> 42-Energy-efficient_application_programming_for_green_cloud_computing.pdf
2025-05-08 15:13:10,534 - INFO - Matched: 42-Energy-efficient application programming for green cloud computing.pdf -> idx: 42, excel_name: Energy-Efficient Application Programming for Green Cloud Computing
2025-05-08 15:13:10,535 - INFO - Change: 42-Energy-efficient application programming for green cloud computing.pdf -> 42-Energy-efficient_application_programming_for_green_cloud_computing.pdf
2025-05-08 15:13:10,535 - INFO - Processing 43-Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain- Specific Accelerators.pdf
2025-05-08 15:13:26,811 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:13:26,882 - INFO - Renamed: 43-Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain- Specific Accelerators.pdf -> 43-Effectively_Scheduling_Computational_Graphs_of_Deep_Neural_Networks_toward_Their_Domain-_Specific_Accelerators.pdf
2025-05-08 15:13:27,494 - INFO - Matched: 43-Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain- Specific Accelerators.pdf -> idx: 43, excel_name: Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their {Domain-Specific} Accelerators
2025-05-08 15:13:27,495 - INFO - Change: 43-Effectively Scheduling Computational Graphs of Deep Neural Networks toward Their Domain- Specific Accelerators.pdf -> 43-Effectively_Scheduling_Computational_Graphs_of_Deep_Neural_Networks_toward_Their_Domain-_Specific_Accelerators.pdf
2025-05-08 15:13:27,495 - INFO - Processing 44-Generator-Based Design of Custom Systems-on-Chip for Numerical Data Analysis.pdf
2025-05-08 15:13:38,730 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:13:38,777 - INFO - Renamed: 44-Generator-Based Design of Custom Systems-on-Chip for Numerical Data Analysis.pdf -> 44-Generator-Based_Design_of_Custom_Systems-on-Chip_for_Numerical_Data_Analysis.pdf
2025-05-08 15:13:39,311 - INFO - Matched: 44-Generator-Based Design of Custom Systems-on-Chip for Numerical Data Analysis.pdf -> idx: 44, excel_name: Generator-Based Design of Custom Systems-on-Chip for Numerical Data Analysis
2025-05-08 15:13:39,311 - INFO - Change: 44-Generator-Based Design of Custom Systems-on-Chip for Numerical Data Analysis.pdf -> 44-Generator-Based_Design_of_Custom_Systems-on-Chip_for_Numerical_Data_Analysis.pdf
2025-05-08 15:13:39,311 - INFO - Processing 47-Brain Inspired Computing_ A Systematic Survey and Future Trends.pdf
2025-05-08 15:13:54,272 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:13:54,320 - INFO - Renamed: 47-Brain Inspired Computing_ A Systematic Survey and Future Trends.pdf -> 47-Brain_Inspired_Computing__A_Systematic_Survey_and_Future_Trends.pdf
2025-05-08 15:13:54,946 - INFO - Matched: 47-Brain Inspired Computing_ A Systematic Survey and Future Trends.pdf -> idx: 47, excel_name: Brain Inspired Computing: A Systematic Survey and Future Trends
2025-05-08 15:13:54,946 - INFO - Change: 47-Brain Inspired Computing_ A Systematic Survey and Future Trends.pdf -> 47-Brain_Inspired_Computing__A_Systematic_Survey_and_Future_Trends.pdf
2025-05-08 15:13:54,946 - INFO - Processing 47-Brain-Inspired Computing_ A Systematic Survey and Future Trends.pdf
2025-05-08 15:14:20,236 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:14:20,280 - INFO - Renamed: 47-Brain-Inspired Computing_ A Systematic Survey and Future Trends.pdf -> 47-Brain-Inspired_Computing__A_Systematic_Survey_and_Future_Trends.pdf
2025-05-08 15:14:20,889 - INFO - Matched: 47-Brain-Inspired Computing_ A Systematic Survey and Future Trends.pdf -> idx: 47, excel_name: Brain Inspired Computing: A Systematic Survey and Future Trends
2025-05-08 15:14:20,889 - INFO - Change: 47-Brain-Inspired Computing_ A Systematic Survey and Future Trends.pdf -> 47-Brain-Inspired_Computing__A_Systematic_Survey_and_Future_Trends.pdf
2025-05-08 15:14:20,890 - INFO - Processing 49-Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices_ A Review.pdf
2025-05-08 15:14:43,252 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:14:43,325 - INFO - Renamed: 49-Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices_ A Review.pdf -> 49-Efficient_Acceleration_of_Deep_Learning_Inference_on_Resource-Constrained_Edge_Devices__A_Review.pdf
2025-05-08 15:14:43,913 - INFO - Matched: 49-Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices_ A Review.pdf -> idx: 49, excel_name: Efficient acceleration of deep learning inference on resource-constrained edge devices: A review
2025-05-08 15:14:43,913 - INFO - Change: 49-Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices_ A Review.pdf -> 49-Efficient_Acceleration_of_Deep_Learning_Inference_on_Resource-Constrained_Edge_Devices__A_Review.pdf
2025-05-08 15:14:43,913 - INFO - Processing 52-FUSION OF AI WITH IOT (AI2OT)_ PARADIGM, CURRENT TRENDS, FUTURE DIRECTIONS.pdf
2025-05-08 15:15:04,932 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:15:04,990 - INFO - Renamed: 52-FUSION OF AI WITH IOT (AI2OT)_ PARADIGM, CURRENT TRENDS, FUTURE DIRECTIONS.pdf -> 52-FUSION_OF_AI_WITH_IOT_(AI2OT)__PARADIGM,_CURRENT_TRENDS,_FUTURE_DIRECTIONS.pdf
2025-05-08 15:15:05,569 - INFO - Matched: 52-FUSION OF AI WITH IOT (AI2OT)_ PARADIGM, CURRENT TRENDS, FUTURE DIRECTIONS.pdf -> idx: 52, excel_name: FUSION OF AI WITH IOT (AI2OT): PARADIGM, CURRENT TRENDS, FUTURE DIRECTIONS
2025-05-08 15:15:05,569 - INFO - Change: 52-FUSION OF AI WITH IOT (AI2OT)_ PARADIGM, CURRENT TRENDS, FUTURE DIRECTIONS.pdf -> 52-FUSION_OF_AI_WITH_IOT_(AI2OT)__PARADIGM,_CURRENT_TRENDS,_FUTURE_DIRECTIONS.pdf
2025-05-08 15:15:05,600 - INFO - Processing 53-An Analog Nearest Class with Multiple Centroids Classifier Implementation, for Depth of Anesthesia Monitoring.pdf
2025-05-08 15:15:19,423 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:15:19,515 - INFO - Renamed: 53-An Analog Nearest Class with Multiple Centroids Classifier Implementation, for Depth of Anesthesia Monitoring.pdf -> 53-An_Analog_Nearest_Class_with_Multiple_Centroids_Classifier_Implementation,_for_Depth_of_Anesthesia_Monitoring.pdf
2025-05-08 15:15:20,158 - INFO - Matched: 53-An Analog Nearest Class with Multiple Centroids Classifier Implementation, for Depth of Anesthesia Monitoring.pdf -> idx: 53, excel_name: An Analog Nearest Class with Multiple Centroids Classifier Implementation, for Depth of Anesthesia Monitoring
2025-05-08 15:15:20,158 - INFO - Change: 53-An Analog Nearest Class with Multiple Centroids Classifier Implementation, for Depth of Anesthesia Monitoring.pdf -> 53-An_Analog_Nearest_Class_with_Multiple_Centroids_Classifier_Implementation,_for_Depth_of_Anesthesia_Monitoring.pdf
2025-05-08 15:15:20,159 - INFO - Processing 54-Floating Gate Transistor-Based Accurate Digital In-Memory Computing for Deep Neural Networks.pdf
2025-05-08 15:15:36,303 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:15:36,399 - INFO - Renamed: 54-Floating Gate Transistor-Based Accurate Digital In-Memory Computing for Deep Neural Networks.pdf -> 54-Floating_Gate_Transistor-Based_Accurate_Digital_In-Memory_Computing_for_Deep_Neural_Networks.pdf
2025-05-08 15:15:37,095 - INFO - Matched: 54-Floating Gate Transistor-Based Accurate Digital In-Memory Computing for Deep Neural Networks.pdf -> idx: 54, excel_name: Floating Gate Transistor‐Based Accurate Digital In‐Memory Computing for Deep Neural Networks
2025-05-08 15:15:37,095 - INFO - Change: 54-Floating Gate Transistor-Based Accurate Digital In-Memory Computing for Deep Neural Networks.pdf -> 54-Floating_Gate_Transistor-Based_Accurate_Digital_In-Memory_Computing_for_Deep_Neural_Networks.pdf
2025-05-08 15:15:37,095 - INFO - Processing 56-Design Methodologies and Tools for Energy-aware IoT-based Applications.pdf
2025-05-08 15:15:47,215 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:15:47,294 - INFO - Renamed: 56-Design Methodologies and Tools for Energy-aware IoT-based Applications.pdf -> 56-Design_Methodologies_and_Tools_for_Energy-aware_IoT-based_Applications.pdf
2025-05-08 15:15:47,872 - INFO - Matched: 56-Design Methodologies and Tools for Energy-aware IoT-based Applications.pdf -> idx: 56, excel_name: Design Methodologies and Tools for Energy-aware IoT-based Applications
2025-05-08 15:15:47,872 - INFO - Change: 56-Design Methodologies and Tools for Energy-aware IoT-based Applications.pdf -> 56-Design_Methodologies_and_Tools_for_Energy-aware_IoT-based_Applications.pdf
2025-05-08 15:15:47,872 - INFO - Processing 57-Arax_ A Runtime Framework for Decoupling Applications from Heterogeneous Accelerators.pdf
2025-05-08 15:16:03,160 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:16:03,242 - INFO - Renamed: 57-Arax_ A Runtime Framework for Decoupling Applications from Heterogeneous Accelerators.pdf -> 57-Arax__A_Runtime_Framework_for_Decoupling_Applications_from_Heterogeneous_Accelerators.pdf
2025-05-08 15:16:03,800 - INFO - Matched: 57-Arax_ A Runtime Framework for Decoupling Applications from Heterogeneous Accelerators.pdf -> idx: 57, excel_name: Arax: a runtime framework for decoupling applications from heterogeneous accelerators
2025-05-08 15:16:03,801 - INFO - Change: 57-Arax_ A Runtime Framework for Decoupling Applications from Heterogeneous Accelerators.pdf -> 57-Arax__A_Runtime_Framework_for_Decoupling_Applications_from_Heterogeneous_Accelerators.pdf
2025-05-08 15:16:03,801 - INFO - Processing 61-A carbon-nanotube-based tensor processing unit.pdf
2025-05-08 15:16:23,008 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:16:23,048 - INFO - Renamed: 61-A carbon-nanotube-based tensor processing unit.pdf -> 61-A_carbon-nanotube-based_tensor_processing_unit.pdf
2025-05-08 15:16:23,671 - INFO - Matched: 61-A carbon-nanotube-based tensor processing unit.pdf -> idx: 61, excel_name: A carbon-nanotube-based tensor processing unit
2025-05-08 15:16:23,671 - INFO - Change: 61-A carbon-nanotube-based tensor processing unit.pdf -> 61-A_carbon-nanotube-based_tensor_processing_unit.pdf
2025-05-08 15:16:23,671 - INFO - Processing 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices_ A Survey.pdf
2025-05-08 15:16:49,182 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:16:49,268 - INFO - Renamed: 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices_ A Survey.pdf -> 62-Advancements_in_Accelerating_Deep_Neural_Network_Inference_on_AIoT_Devices__A_Survey.pdf
2025-05-08 15:16:49,830 - INFO - Matched: 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices_ A Survey.pdf -> idx: 62, excel_name: Advancements in accelerating deep neural network inference on aiot devices: A survey
2025-05-08 15:16:49,830 - INFO - Change: 62-Advancements in Accelerating Deep Neural Network Inference on AIoT Devices_ A Survey.pdf -> 62-Advancements_in_Accelerating_Deep_Neural_Network_Inference_on_AIoT_Devices__A_Survey.pdf
2025-05-08 15:16:49,830 - INFO - Processing ML_Processors_Are_Going_Multi-Core_A_performance_dream_or_a_scheduling_nightmare.pdf
2025-05-08 15:16:57,893 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:16:57,983 - INFO - Renamed: ML_Processors_Are_Going_Multi-Core_A_performance_dream_or_a_scheduling_nightmare.pdf -> 59-ML_Processors_Are_Going_Multi-Core__A_performance_dream_or_a_scheduling_nightmare_.pdf
2025-05-08 15:16:58,633 - INFO - Matched: ML_Processors_Are_Going_Multi-Core_A_performance_dream_or_a_scheduling_nightmare.pdf -> idx: 59, excel_name: ML Processors Are Going Multi-Core: A performance dream or a scheduling nightmare?
2025-05-08 15:16:58,633 - INFO - Change: ML_Processors_Are_Going_Multi-Core_A_performance_dream_or_a_scheduling_nightmare.pdf -> 59-ML_Processors_Are_Going_Multi-Core__A_performance_dream_or_a_scheduling_nightmare_.pdf
2025-05-08 15:16:58,633 - INFO - Processing MotionDeltaCNN_Sparse CNN Inference of Frame Differences in Moving Camera.pdf
2025-05-08 15:17:14,210 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:17:14,254 - INFO - Renamed: MotionDeltaCNN_Sparse CNN Inference of Frame Differences in Moving Camera.pdf -> 30-MotionDeltaCNN__Sparse_CNN_Inference_of_Frame_Differences_in_Moving_Camera_Videos_with_Spherical_Buffers_and_Padded_Convolutions.pdf
2025-05-08 15:17:14,856 - INFO - Matched: MotionDeltaCNN_Sparse CNN Inference of Frame Differences in Moving Camera.pdf -> idx: 30, excel_name: MotionDeltaCNN: Sparse CNN Inference of Frame Differences in Moving Camera Videos with Spherical Buffers and Padded Convolutions
2025-05-08 15:17:14,856 - INFO - Change: MotionDeltaCNN_Sparse CNN Inference of Frame Differences in Moving Camera.pdf -> 30-MotionDeltaCNN__Sparse_CNN_Inference_of_Frame_Differences_in_Moving_Camera_Videos_with_Spherical_Buffers_and_Padded_Convolutions.pdf
2025-05-08 15:17:14,856 - INFO - Processing Motivation_for_and_Evaluation_of_the_First_Tensor_Processing_Unit.pdf
2025-05-08 15:17:23,101 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:17:23,109 - INFO - Renamed: Motivation_for_and_Evaluation_of_the_First_Tensor_Processing_Unit.pdf -> 4-Top_Picks_Motivation_for_and_Evaluation_of_the_First_Tensor_Processing_Unit.pdf
2025-05-08 15:17:23,740 - INFO - Matched: Motivation_for_and_Evaluation_of_the_First_Tensor_Processing_Unit.pdf -> idx: 4, excel_name: Motivation for and evaluation of the first tensor processing unit
2025-05-08 15:17:23,740 - INFO - Change: Motivation_for_and_Evaluation_of_the_First_Tensor_Processing_Unit.pdf -> 4-Top_Picks_Motivation_for_and_Evaluation_of_the_First_Tensor_Processing_Unit.pdf
2025-05-08 15:17:23,740 - INFO - Processing Multi-ANN_embedded_system_based_on_a_custom_3D-DRAM.pdf
2025-05-08 15:17:39,193 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:17:39,206 - INFO - Renamed: Multi-ANN_embedded_system_based_on_a_custom_3D-DRAM.pdf -> 14-Multi-ANN_embedded_system_based_on_a_custom_3D-DRAM.pdf
2025-05-08 15:17:39,762 - INFO - Matched: Multi-ANN_embedded_system_based_on_a_custom_3D-DRAM.pdf -> idx: 14, excel_name: Multi-ANN embedded system based on a custom 3D-DRAM
2025-05-08 15:17:39,762 - INFO - Change: Multi-ANN_embedded_system_based_on_a_custom_3D-DRAM.pdf -> 14-Multi-ANN_embedded_system_based_on_a_custom_3D-DRAM.pdf
2025-05-08 15:17:39,762 - INFO - Processing Neuromorphic_and_In_Memory_Computing_Bas.pdf
2025-05-08 15:17:59,096 - INFO - HTTP Request: POST https://api.siliconflow.cn/v1/chat/completions "HTTP/1.1 200 OK"
2025-05-08 15:17:59,152 - INFO - Renamed: Neuromorphic_and_In_Memory_Computing_Bas.pdf -> 24-Neuromorphic_and_In-Memory_Computing_Based_on_Memristive_Circuits_for_Predictive_Maintenance_and_Supply-Chain_Management_and_Security.pdf
2025-05-08 15:17:59,714 - INFO - Matched: Neuromorphic_and_In_Memory_Computing_Bas.pdf -> idx: 24, excel_name: Neuromorphic and In-Memory Computing Based on Memristive Circuits for Predictive Maintenance and Supply-Chain Security
2025-05-08 15:17:59,715 - INFO - Change: Neuromorphic_and_In_Memory_Computing_Bas.pdf -> 24-Neuromorphic_and_In-Memory_Computing_Based_on_Memristive_Circuits_for_Predictive_Maintenance_and_Supply-Chain_Management_and_Security.pdf
2025-05-08 15:17:59,715 - INFO - Processing OD-SGD_One-Step Delay Stochastic Gradient Descent.pdf
...@@ -15,8 +15,6 @@ if __name__ == "__main__": ...@@ -15,8 +15,6 @@ if __name__ == "__main__":
# Path对象后跟/用于连接地址 # Path对象后跟/用于连接地址
# print(excel_path)
# 创建日志目录 # 创建日志目录
log_dir = cwd_dir / "logs" log_dir = cwd_dir / "logs"
log_dir.mkdir(exist_ok=True) log_dir.mkdir(exist_ok=True)
......
...@@ -6,7 +6,7 @@ import openpyxl ...@@ -6,7 +6,7 @@ import openpyxl
from fuzzywuzzy import fuzz from fuzzywuzzy import fuzz
import json import json
def get_authors( content, configModel, client): def get_key_info( content, configModel, client):
system_prompt = """ system_prompt = """
Act as an expert metadata extraction assistant. Act as an expert metadata extraction assistant.
Analyze the following text, which is extracted from the first page of a document (likely a scientific paper or report). Analyze the following text, which is extracted from the first page of a document (likely a scientific paper or report).
...@@ -21,9 +21,37 @@ def get_authors( content, configModel, client): ...@@ -21,9 +21,37 @@ def get_authors( content, configModel, client):
- Extract all associated institutions of authors. - Extract all associated institutions of authors.
- **Countrys:** - **Countrys:**
- Extract all associated countrys of authors. - Extract all associated countrys of authors.
- **ISSUE:**
- Extract where the paper is published like journal or session.
- Title, authors, institutions and countrys should be four separate keys, not nested together. - Title, authors, institutions and countrys should be four separate keys, not nested together.
- Use highcase for first letter of key. - Use highcase for first letter of key.
- **Handling Missing Data:** If no data of a field can be identified in the text, the field in the JSON should be an empty list `[]`. - **Handling Missing Data:** If no data of a field can be identified in the text, the field in the JSON should be an empty list `[]`.
Example Output:
{
"Title": "Laius: Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters",
"Authors": [
"Quan Chen",
"Daniel Edward Mawhirter",
"Bo Wu",
"Chao Li",
],
"Institutions": [
"Shanghai Jiao Tong University",
"Colorado School of Mines",
"Colorado School of Mines",
"Shanghai Jiao Tong University",
],
"Countrys": [
"China",
"United States",
"United States",
"China",
],
"ISSURE": [
"IEEE Transactions on Computers"
]
}
""" """
response = client.chat.completions.create( response = client.chat.completions.create(
...@@ -130,7 +158,7 @@ def citationProcess(config: dict): ...@@ -130,7 +158,7 @@ def citationProcess(config: dict):
configModel = config["model"] configModel = config["model"]
# 提取关键信息 # 提取关键信息
result = get_authors(first_page_text, configModel, client) result = get_key_info(first_page_text, configModel, client)
if result is not None: if result is not None:
# 解析JSON结果, 提取论文标题 # 解析JSON结果, 提取论文标题
...@@ -149,7 +177,7 @@ def citationProcess(config: dict): ...@@ -149,7 +177,7 @@ def citationProcess(config: dict):
if similarity >= 85: if similarity >= 85:
# 重命名PDF文件 # 重命名PDF文件
new_pdf_name = f"{idx}-{pdf_title.replace(':', '_')}.pdf" # 将冒号替换为连字符 new_pdf_name = f"{idx}-{pdf_title.replace(':', '_').replace(' ', '_').replace('?', '_')}.pdf" # 将冒号替换为连字符
new_pdf_path = file.parent / new_pdf_name new_pdf_path = file.parent / new_pdf_name
try: try:
file.rename(new_pdf_path) file.rename(new_pdf_path)
......
abbr,fullname
PPoPP,ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming
FAST,USENIX Conference on File and Storage Technologies
DAC,Design Automation Conference
HPCA,IEEE International Symposium on High Performance Computer Architecture
MICRO,IEEE/ACM International Symposium on Microarchitecture
SC,"International Conference for High Performance Computing, Networking, Storage, and Analysis"
ASPLOS,International Conference on Architectural Support for Programming Languages and Operating Systems
ISCA,International Symposium on Computer Architecture
USENIX ATC,USENIX Annual Technical Conference
EuroSys,European Conference on Computer Systems
SIGCOMM,"ACM International Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication"
MobiCom,ACM International Conference on Mobile Computing and Networking
INFOCOM,IEEE International Conference on Computer Communications
NSDI,Symposium on Network System Design and Implementation
CCS,ACM Conference on Computer and Communications Security
EUROCRYPT,International Conference on the Theory and Applications of Cryptographic Techniques
S&P,IEEE Symposium on Security and Privacy
CRYPTO,International Cryptology Conference
USENIX Security,USENIX Security Symposium
NDSS,Network and Distributed System Security Symposium
PLDI,ACM SIGPLAN Conference on Programming Language Design and Implementation
POPL,ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages
FSE,ACM International Conference on the Foundations of Software Engineering
SOSP,ACM Symposium on Operating Systems Principles
OOPSLA,"Conference on Object-Oriented Programming Systems, Languages,and Applications"
ASE,International Conference on Automated Software Engineering
ICSE,International Conference on Software Engineering
ISSTA,International Symposium on Software Testing and Analysis
OSDI,USENIX Symposium on Operating Systems Design and Implementations
FM,International Symposium on Formal Methods
SIGMOD,ACM SIGMOD Conference
SIGKDD,ACM SIGKDD Conference on Knowledge Discovery and Data Mining
ICDE,IEEE International Conference on Data Engineering
SIGIR,International ACM SIGIR Conference on Research and Development in Information Retrieval
VLDB,International Conference on Very Large Data Bases
STOC,ACM Symposium on Theory of Computing
SODA,ACM-SIAM Symposium on Discrete Algorithms
CAV,International Conference on Computer Aided Verification
FOCS,IEEE Annual Symposium on Foundations of Computer Science
LICS,ACM/IEEE Symposium on Logic in Computer Science
ACM MM,ACM International Conference on Multimedia
SIGGRAPH,ACM Special Interest Group on Computer Graphics
VR,IEEE Virtual Reality
IEEE VIS,IEEE Visualization Conference
AAAI,AAAI Conference on Artificial Intelligence
NeurIPS,Conference on Neural Information Processing Systems
ACL,Annual Meeting of the Association for Computational Linguistics
CVPR,IEEE/CVF Computer Vision and Pattern Recognition Conference
ICCV,International Conference on Computer Vision
ICML,International Conference on Machine Learning
IJCAI,International Joint Conference on Artificial Intelligence
CSCW,ACM Conference on Computer Supported Cooperative Work and Social Computing
CHI,ACM Conference on Human Factors in Computing Systems
UbiComp/IMWUT,"ACM international joint conference on Pervasive and Ubiquitous Computing/ Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies"
UIST,ACM Symposium on User Interface Software and Technology
WWW,International World Wide Web Conference
RTSS,IEEE Real-Time Systems Symposium
WINE,Conference on Web and Internet Economics
TOCS,ACM Transactions on Computer Systems
TOS,ACM Transactions on Storage
TCAD,IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
TC,IEEE Transactions on Computers
TPDS,IEEE Transactions on Parallel and Distributed Systems
TACO,ACM Transactions on Architecture and Code Optimization
JSAC,IEEE Journal on Selected Areas in Communications
TMC,IEEE Transactions on Mobile Computing
TON,IEEE/ACM Transactions on Networking
TDSC,IEEE Transactions on Dependable and Secure Computing
TIFS,IEEE Transactions on Information Forensics and Security
,Journal of Cryptology
TOPLAS,ACM Transactions on Programming Languages and Systems
TOSEM,ACM Transactions on Software Engineering and Methodology
TSE,IEEE Transactions on Software Engineering
TSC,IEEE Transactions on Services Computing
TODS,ACM Transactions on Database Systems
TOIS,ACM Transactions on Information Systems
TKDE,IEEE Transactions on Knowledge and Data Engineering
VLDBJ,The VLDB Journal
TIT,IEEE Transactions on Information Theory
IANDC,Information and Computation
SICOMP,SIAM Journal on Computing
TOG,ACM Transactions on Graphics
TIP,IEEE Transactions on Image Processing
TVCG,IEEE Transactions on Visualization and Computer Graphics
AI,Artificial Intelligence
TPAMI,IEEE Transactions on Pattern Analysis and Machine Intelligence
IJCV,International Journal of Computer Vision
JMLR,Journal of Machine Learning Research
TOCHI,ACM Transactions on Computer-Human Interaction
IJHCS,International Journal of Human-Computer Studies
JACM,Journal of the ACM
Proc. IEEE,Proceedings of the IEEE
SCIS,Science China Information Sciences
title,venue
Learning to Generalize With Object-Centric Agents in the Open World Survival Game Crafter,"IEEE Transactions on Games ( Volume: 16, Issue: 2, June 2024)"
Focus-Then-Decide: Segmentation-Assisted Reinforcement Learning,"Proceedings of the AAAI Conference on Artificial Intelligence, 3"
"Advancing DRL Agents in Commercial Fighting Games: Training, Integration,
and Agent-Human Alignment",Proceedings of the 41th International Conference on Machine Learning (ICML 2024)
Discovering and Using Structure in Autonomous Machine Learning,ETH Zurich thesis
Jose Luis Flores Campana,博士论文
"""
功能:判断论文是否属于CCF-A类会议
输入文件:CCF_A_list.csv,包含两列:abbr,fullname
输入文件夹:data/title_venue,包含若干csv文件,每个文件的标题栏是title,venue
输出文件夹:data/is_ccfa,对用title_venue文件夹下的每个csv文件进行处理,输出文件名相同,标题栏是title,venue,is_ccf_a
"""
import os
import pandas as pd
import re
from rapidfuzz import fuzz, process
from collections import defaultdict
# 路径参数
input_dir = "data/title_venue"
output_dir = "data/is_ccfa"
ccf_a_file = "CCF_A_list.csv" # 请根据实际路径调整
os.makedirs(output_dir, exist_ok=True)
# 读取CCF-A简称和全称
ccf_a = pd.read_csv(ccf_a_file)
# 缩写全部去空格
ccf_a_abbrs = set(ccf_a['abbr'].dropna().str.lower().str.replace(' ','').str.strip().tolist())
ccf_a_fullnames = set(ccf_a['fullname'].dropna().str.lower().str.strip().tolist())
ccf_a_names = ccf_a_abbrs | ccf_a_fullnames
def clean_venue(venue):
if pd.isnull(venue) or str(venue).strip() == "":
return ""
text = str(venue)
text = re.sub(r'[\(\(][^)\)]*?volume[^)\)]*?[\)\)]', '', text, flags=re.IGNORECASE)
text = re.sub(r"[0-9'\",\.\!\/\\\?\;\-\_]", "", text)
return text.lower().strip()
def extract_brackets(text):
match = re.search(r'[\(\(](.*?)[\)\)]', text)
if match:
return match.group(1).strip().lower()
return None
def extract_before_colon(text):
parts = re.split(r'[::]', text)
if len(parts) > 1:
return parts[0].strip().lower()
return None
def is_ccf_a(venue, threshold=70):
if pd.isnull(venue) or str(venue).strip() == "":
return 0
venue_clean = venue.lower().strip()
if 'south pacific design automation conference' in venue_clean:
return 0
if 'International Journal' in venue_clean and ('Computer Vision' not in venue_clean or 'Human-Computer Studies' not in venue_clean):
return 0
if 'ieee micro' in venue_clean or 'design automation conference' in venue_clean or 'ieee transactions on parallel & distributed systems' in venue_clean or 'ieee transactions on computers' in venue_clean or 'transactions on architecture and code optimization' in venue_clean or '信息科学' in venue_clean or 'pldi' in venue_clean or 'transactions on computer systems' in venue_clean or 'conference on computer vision and pattern recognition' in venue_clean or 'transactions on computeraided design of integrated circuits and systems' in venue_clean or 'aaai conference on artificial intelligence' in venue_clean or 'mm ’' in venue_clean or 'transactions on parallel and distributed systems' in venue_clean:
return 1
if 'ijcnn' in venue_clean or 'iccd' in venue_clean or 'icpr' in venue_clean or 'aicas' in venue_clean or 'acm transactions on embedded computing systems' in venue_clean or 'iccad' in venue_clean or 'artificial intelligence review' in venue_clean or 'ieee journal of selected topics in quantum electronics' in venue_clean or 'future generation computer systems' in venue_clean or 'ieee transactions on services computing' in venue_clean or 'ipdps' in venue_clean or 'isocc' in venue_clean or 'microprocessors and microsystems' in venue_clean or 'ccet' in venue_clean or 'microelectronics' in venue_clean or 'asid' in venue_clean or 'caai' in venue_clean or 'euromicro conference on' in venue_clean or 'asicon' in venue_clean or 'transactions on system and lsi design methodology' in venue_clean or 'selected areas in communications' in venue_clean or 'high performance computing data and analytics' in venue_clean or 'innovation communication and engineering' in venue_clean or 'sysml conference' in venue_clean or 'international conference on digital signal processing' in venue_clean or 'international conference on computer and communication systems' in venue_clean or 'ieee transactions on services computing' in venue_clean or 'international symposium on performance analysis of systems and software' in venue_clean or 'wireless communications and mobile computing' in venue_clean or 'sigops operating systems review' in venue_clean or 'siggraphasia' in venue_clean or 'international journal of computational intelligence systems' in venue_clean or 'journal of ikeee' in venue_clean or 'artificial intelligence and security' in venue_clean or 'journal of software' in venue_clean or 'advances in artificial intelligence' in venue_clean or 'international symposium on networksonchip' in venue_clean or 'international journal of web information systems' in venue_clean or 'journal of timecritical computing systems' in venue_clean or 'journal of advanced computer science and applications' in venue_clean or 'conference on computing and informatics' in venue_clean or 'journal of big data' in venue_clean or 'journal of parallel programming' in venue_clean or 'international conference on computer and communications' in venue_clean or 'journal of supercomputing' in venue_clean or 'conference on computing frontiers' in venue_clean or 'international conference on industry applications' in venue_clean or 'artificial intelligence advances' in venue_clean or 'pattern recognition and artificial intelligence' in venue_clean or 'journal of electronics and communications' in venue_clean or 'journal of data science and analytics' in venue_clean or 'journal of parallel emergent and distributed systems' in venue_clean or 'conference on advances in electrical engineering' in venue_clean or 'international conference on frontiers in computing and systems' in venue_clean or 'conference on sentiment analysis and deep learning' in venue_clean or 'acm symposium on cloud computing' in venue_clean or 'conference on tools with artificial intelligence' in venue_clean or 'conference on supercomputing' in venue_clean or 'frontiers in artificial intelligence ' in venue_clean or 'conference on communication technology' in venue_clean or 'big data information and computer network' in venue_clean or 'conference on software engineering education and training' in venue_clean or 'journal of computers' in venue_clean or 'washington' in venue_clean or 'international conference on cloud computing' in venue_clean or 'acm on programming languages' in venue_clean or 'conference on high performance computing' in venue_clean or 'ubm designcon conference' in venue_clean or 'scientia sinica informationis' in venue_clean or 'symposium on computing and networking' in venue_clean or 'journal of mathematics' in venue_clean or 'iet image processing' in venue_clean or 'journal of computing and digital systems' in venue_clean or 'conference on very large scale integration' in venue_clean or 'journal of networking and computing' in venue_clean or 'transactions on cryptographic hardware and embedded systems' in venue_clean or 'International Conference on Hybrid Intelligent Systems' in venue_clean or 'international conference on asic' in venue_clean or 'international conference on nexgen technologies' in venue_clean or 'international conference on computer and information technology' in venue_clean or 'international journal of high performance systems architecture' in venue_clean or 'international conference on power and energy engineering' in venue_clean or 'international symposium on smart electronic systems' in venue_clean or 'international symposium on smart electronic systems' in venue_clean:
return 0
# 特判 IEEE Transactions on 开头
if venue_clean.startswith("ieee transactions on") or venue_clean.startswith("ieee international conference on") or venue_clean.startswith("ccf transactions on") or venue_clean.startswith("engineering applications of") or venue_clean.startswith("acm transactions on") or venue_clean.startswith("international conference on") or venue_clean.startswith("ieice transactions on"):
ccf_a_fullnames_clean = {clean_venue(name) for name in ccf_a_fullnames}
if venue_clean in ccf_a_fullnames_clean:
return 1
else:
return 0
# 1. 括号内简称匹配(只做去空格完全匹配)
bracket_content = extract_brackets(venue_clean)
if bracket_content:
bracket_content_no_space = bracket_content.replace(' ', '')
if bracket_content_no_space in ccf_a_abbrs:
return 1
# 2. 冒号前内容匹配(只做去空格完全匹配)
colon_content = extract_before_colon(venue_clean)
if colon_content:
colon_content_no_space = colon_content.replace(' ', '')
if colon_content_no_space in ccf_a_abbrs:
return 1
# 3. venue_clean整体匹配
if len(venue_clean) <= 6:
# 长度小于等于6,只做精确匹配
if venue_clean in ccf_a_names:
return 1
else:
return 0
else:
# 正常模糊匹配
if venue_clean in ccf_a_names:
return 1
result = process.extractOne(
venue_clean,
ccf_a_names,
scorer=fuzz.ratio
)
if result:
match, score, _ = result
if score >= threshold:
return 1
else:
return 0
else:
return 0
processed_files = [
# 'c48_DaDiannao.csv',
# 'j20-DaDianNao.csv',
# 'j24-DianNao family.csv',
# 'c43-DianNao.csv',
# 'c35-Cambricon-X.csv',
# 'c34-Cambricon.csv',
# '',
# 'c40-shidiannao.csv',
# 'c32.csv',
# 'c53-BenchNN.csv',
# 'c66-Fast complete memory.csv',
# 'j53-System Architecture.csv',
# 'z45.csv',
# 'j29.csv',
# 'c64.csv',
# 'c41.csv',
# 'j35-A small-footprint.csv',
# 'j21-An Accelerator for High.csv',
# 'c51-Statistical....csv',
# 'c47.csv',
# 'c30.csv',
# 'c57.csv',
# 'c28.csv',
# 'j16.csv',
# 'c36-PuDianNao.csv'
]
file_list = [f for f in os.listdir(input_dir) if f.endswith('.csv')]
count = 0
for fname in file_list:
count += 1
# if count <= 79:
# continue
print(count)
if fname in processed_files:
print(f"Skipping {fname} ...")
continue
input_path = os.path.join(input_dir, fname)
output_path = os.path.join(output_dir, fname)
print(f"Processing {input_path} ...")
# breakpoint()
papers = pd.read_csv(input_path)
if 'title' not in papers.columns or 'venue' not in papers.columns:
print(f"Warning: {fname} does not have required columns.")
continue
papers['venue_clean'] = papers['venue'].apply(clean_venue)
venue_map = defaultdict(list)
for idx, row in papers.iterrows():
venue_map[row['venue_clean']].append(idx)
venue_ccfa_dict = {}
cnt = 0
for v_clean in venue_map:
venue_ccfa_dict[v_clean] = is_ccf_a(v_clean)
print(f'{v_clean} -> {venue_ccfa_dict[v_clean]}')
print(f'still have {len(venue_map) - len(venue_ccfa_dict)} left')
cnt += 1
# if cnt % 10 == 0:
# breakpoint()
is_ccf_a_list = [venue_ccfa_dict[vc] for vc in papers['venue_clean']]
papers['is_ccf_a'] = is_ccf_a_list
papers[['title', 'venue', 'is_ccf_a']].to_csv(output_path, index=False, encoding='utf-8-sig')
print(f"Saved to {output_path}")
print("全部处理完成!")
\ No newline at end of file
# !注意:这个脚本可能判断不准,请一定人工检查!
## 用法:python is_ccfa.py
## 功能:判断论文是否属于CCF-A类会议
- 输入文件:`CCF_A_list.csv`,包含两列:`abbr,fullname`
- 输入文件夹:`data/title_venue`,包含若干csv文件,每个文件的标题栏是`title,venue`
- 输出文件夹:`data/is_ccfa`,对用`title_venue`文件夹下的每个csv文件进行处理,输出文件名相同,标题栏是`title,venue,is_ccf_a`
\ No newline at end of file
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment