


default search action
CCF Transactions on High Performance Computing, Volume 7
Volume 7, Number 1, February 2025
- Hanzheng Liang, Chencheng Deng, Peng Zhang, Jianbin Fang, Tao Tang

, Chun Huang:
An empirical performance evaluation of SYCL on ARM multi-core processors. 1-16 - Youxuan Xu, Tong Wu, Shigang Li

, Xueying Wang, Jingjing Wang:
SparkAttention: high-performance multi-head attention for large models on Volta GPU architecture. 17-28 - Tao Huang, Yonggui Liang, Shubao Yu, Kexin Chen:

TxCocket: an innovative solution for efficient cross-node data transmission enabled by CXL-based shared memory. 29-42 - Wenhao Dai, Ziyi Jia, Yuesi Bai, Qingxiao Sun

:
Convergence-aware operator-wise mixed-precision training. 43-57 - Jin Zhang, Jincheng Zhou, Xiang Zhang, Di Ma, Chunye Gong:

Fine-grained vectorized merge sorting on RISC-V: from register to cache. 58-71 - Muchun Peng, Qinglin Wang, Yuechao Liang, Weihao Guo, Shun Yang, Yaling Liang, Yongzhen Shi, Ligang Cao, Jie Liu:

GreenB+Tree: an energy-efficient B+tree for MIMD architectures. 72-84
Volume 7, Number 2, April 2025
- Pin Chen

, Qing Mo, Zexin Xu, Xianwei Zhang, Yutong Lu:
Star-gen: an HPC-AI framework for constructing large-scale computational materials database. 85-99 - Wentao Feng, Shizhe Shang, Pengfei Li, Hailong Yang, Zhongzhi Luan

, Depei Qian:
SyncNOVA: an end-to-end fine-grained profiling tool oN lOck behaVior detection and critical section diAgnosis. 100-113 - Ningxi Tian, Silu Huang, Xiaowen Xu

:
Mixed precision block-Jacobi preconditioner: algorithms, performance evaluation and feature analysis. 114-128 - Jianfei Xu, Lianhua He, Zhong Jin:

Mixed precision SpMV on GPUs for irregular data with hierarchical precision selection. 129-141 - Wenlong Fan, Haobo Hua

, Jiandong Shang
, Zhuxin Wen, Hengliang Guo, Litao Zhang:
Optimizing 2D convolution for DCUs. 142-154 - Xiangyu Meng, Xun Wang

, Mingzhen Li, Guangming Tan, Weile Jia:
An interpretable DeePMD-kit performance model for emerging supercomputers. 155-168 - Heming Zhong, Xiaojian Pan, Zengquang He, Haoling Wang, Dan Huang, Zhiguang Chen:

GPU acceleration for DNA sequence alignment algorithm and its application. 169-177
Volume 7, Number 3, June 2025
- Zhao Mao, Xingjun Zhang

, Longxiang Wang
:
KANETAS: an elastic scheduler for heterogeneous many-core systems. 179-193 - Dongting Chen, Jie Shen, Chun Huang, Xin Yi:

An empirical study of error-free transformations for enhancing mathematical function precision. 194-210 - Hengzhong Liang, Han Huang, Xianwei Zhang

:
SuCL: supply unified communication layer to improve SYCL-based heterogeneous computing. 211-225 - Zhangjie Tan, Jinfang Jia, Zhengsheng Ning, Jianqiang Huang, Xiaoying Wang:

Research on GPU transplantation optimization of PRM scalar advection scheme in GRAPES global forecast system. 226-244 - Yalin Zhu, Youquan Chang, Jiapeng Zhang

, Yingjie Song, Zhuo Tang:
An optimized hierarchical MapReduce framework in supercomputing Internet environment. 245-259 - Da Huo, Xin You, Zhibo Xuan, Hailong Yang

, Zhongzhi Luan, Depei Qian:
Hotspy: identifying performance hotspot with graph neural network based static analysis. 260-274 - Yunkun Liao, Jingya Wu

, Wenyan Lu, Huawei Li
, Xiaowei Li, Guihai Yan:
FUS: FPGA-based Universal Sketch with homogeneous and heterogeneous memory architectures. 275-290 - Ronghui Cao, Peng Zhang, Yiming Wu, Jun Liu, Haibin Su:

Adaptive container scheduling based on reinforcement learning in kubernetes. 291-304
Volume 7, Number 5, October 2025
- Kai Di

, Pan Li, Tienyu Zuo, Fulin Chen, Yuanshuang Jiang, Lei Kong, Yichuan Jiang, Dan Chen:
Optimizing data interaction strategies for unreliable agents in multiplex networked industrial environments. 379-402 - Xiaoyong Tang, Xiaotian Li, Ronghui Cao:

An online resource-aware leader election algorithm based on Kubernetes load balancing. 403-412 - Edward Chuah

, Arshad Jhumka, Sai Narasimhamurthy, Aladdin Ayesh
:
Deep learning-based prediction of major page faults in cluster systems. 413-430 - Xiaoyong Fan, Yuan Zhuang, Yunhui Zeng

:
A barotropic solver capable of reducing global synchronization latency in parallel ocean program. 431-446 - Xiaoning Wang, Yining Zhao, Shasha Lu, Haili Xiao:

Practice and observation: live migration for MPI workload. 447-464 - Mengsi He, Zhongming Fu

, Wenlong Tian:
Optimization of fault tolerance for iterative graph algorithm in spark GraphX based on high performance computing cluster. 465-477
Volume 7, Number 6, December 2025
- Lin Zhu, Qiang-Sheng Hua

, Hai Jin:
A parallel all-pairs shortest paths algorithm for dynamic graphs. 479-493 - Chunru Dong, Junyuan Liu, Qiang Hua, Jiahong Tang, Feng Zhang:

IEPT: input-enhanced prompt tuning for visual-language models. 494-508 - Deyou Tang

, Jialang Liang, Pingjian Zhang, Qingwen Deng:
ScalableAligner: a fast NGS mapping tool for shared-memory system. 509-522 - Jiaming Hu, Chuangbo Hao, Mengzhen Li, Yuhao Wang, Maowen Lu, Dachuan Xu:

Defeating decoys: deletion-robust submodular optimization for UAV swarm target assignment problem. 523-536 - Yaxin Li, Xuebin Chi

, Jinrong Jiang
, Run Guo
, Lian Zhao
, Chen Li
, Yidi Bai, Junlin Wei
, Xiang Han, Guangqing Zhou:
A parallel algorithm for an Ocean General Circulation Model based on a unified dynamics framework. 537-555 - Shiqiang Nie

, Tingshen Ruan, Ruijia Chen, Bo Song, Song Liu, Weiguo Wu:
ZeroCopy: file system assisted container buffer migration in cloud computing system. 556-573 - Song Shi, Jinfang Jia, Wandong Xue, Jianqiang Huang:

Collaborative pseudo-label transfer for few-shot unsupervised domain adaptation. 574-588 - Jiashu Yao, Junmin Xiao

, Baokang Xie, Shilong Xu, Xi Chen, Yunfei Pang, Mingyi Li, Hui Ma, Yun Song, Guangming Tan:
Hiperti: high performance system for cross-platform code generation of transformer model inference based on MLIR. 589-622 - Yinling Wang, Yuping Ge, Yubai Zhang, Hui Tian:

A new approximation algorithm for two-machine flow shop scheduling with transporter coordinate. 623-631 - Zhicheng Yao, Wenguo Yang

:
Reinforcement learning for airline multi-class continuous dynamic pricing. 632-642 - Yunjie Bai, Xuezhi Wu, Aimin Yang:

HPC-optimized hybrid XGBoost-MLP model for large-scale pellet metallurgical performance prediction. 643-651 - Bin Deng, Guangqin Hu, Weidong Li

, Jin Xu:
Multi-resource any price share fair allocation with placement constraints and an external resource in cloud-edge collaboration systems. 652-670 - Kaijia Luo, Haibin Zhu, Dongning Liu:

Solving the allocation problem of reentrant production via group role assignment. 671-688

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














