Keyword : GPU


A Novel Procedure for Implementing a Turbo Decoder on a GPU with Coalesced Memory Access
Heungseop AHN Seungwon CHOI 
Publication:   
Publication Date: 2017/05/01
Vol. E100-A  No. 5 ; pp. 1188-1196
Type of Manuscript:  PAPER
Category: Communication Theory and Signals
Keyword: 
GPUCUDAturbo decodercoalesced memory accessSDR
 Summary | Full Text:PDF(837.5KB)

Cache-Aware, In-Place Rotation Method for Texture-Based Volume Rendering
Yuji MISAKI Fumihiko INO Kenichi HAGIHARA 
Publication:   
Publication Date: 2017/03/01
Vol. E100-D  No. 3 ; pp. 452-461
Type of Manuscript:  PAPER
Category: Fundamentals of Information Systems
Keyword: 
cache optimizationvolume renderingin-place algorithmGPUCUDA
 Summary | Full Text:PDF(1.1MB)

Geometry Clipmaps Terrain Rendering Using Hardware Tessellation
Ge SONG Hongyu YANG Yulong JI 
Publication:   
Publication Date: 2017/02/01
Vol. E100-D  No. 2 ; pp. 401-404
Type of Manuscript:  LETTER
Category: Computer Graphics
Keyword: 
terrain renderingGPUtessellation shadergeometry clipmaps
 Summary | Full Text:PDF(1.5MB)

Fully Parallelized LZW Decompression for CUDA-Enabled GPUs
Shunji FUNASAKA Koji NAKANO Yasuaki ITO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/12/01
Vol. E99-D  No. 12 ; pp. 2986-2994
Type of Manuscript:  Special Section PAPER (Special Section on Parallel and Distributed Computing and Networking)
Category: GPU computing
Keyword: 
data compressionbig dataparallel algorithmGPUCUDA
 Summary | Full Text:PDF(428KB)

Cache-Aware GPU Optimization for Out-of-Core Cone Beam CT Reconstruction of High-Resolution Volumes
Yuechao LU Fumihiko INO Kenichi HAGIHARA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/12/01
Vol. E99-D  No. 12 ; pp. 3060-3071
Type of Manuscript:  PAPER
Category: Computer System
Keyword: 
cone beam reconstructionGPUCUDAcache optimization
 Summary | Full Text:PDF(1.4MB)

GPU-Accelerated Bulk Execution of Multiple-Length Multiplication with Warp-Synchronous Programming Technique
Takumi HONDA Yasuaki ITO Koji NAKANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/12/01
Vol. E99-D  No. 12 ; pp. 3004-3012
Type of Manuscript:  Special Section PAPER (Special Section on Parallel and Distributed Computing and Networking)
Category: GPU computing
Keyword: 
multiple-length multiplicationGPUGPGPUparallel processingwarp-synchronous
 Summary | Full Text:PDF(523KB)

A Memory-Access-Efficient Implementation for Computing the Approximate String Matching Algorithm on GPUs
Lucas Saad Nogueira NUNES Jacir Luiz BORDIM Yasuaki ITO Koji NAKANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/12/01
Vol. E99-D  No. 12 ; pp. 2995-3003
Type of Manuscript:  Special Section PAPER (Special Section on Parallel and Distributed Computing and Networking)
Category: GPU computing
Keyword: 
approximate string matchingedit distanceGPUCUDAshuffle instructions
 Summary | Full Text:PDF(2MB)

BLM-Rank: A Bayesian Linear Method for Learning to Rank and Its GPU Implementation
Huifeng GUO Dianhui CHU Yunming YE Xutao LI Xixian FAN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2016/04/01
Vol. E99-D  No. 4 ; pp. 896-905
Type of Manuscript:  Special Section PAPER (Special Section on Data Engineering and Information Management)
Category: 
Keyword: 
rankingBayesian Personalized Rankingstochastic gradient methodGPU
 Summary | Full Text:PDF(806.3KB)

Implementation of Viterbi Decoder toward GPU-Based SDR Receiver
Kosuke TOMITA Masahide HATANAKA Takao ONOYE 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2015/11/01
Vol. E98-A  No. 11 ; pp. 2246-2253
Type of Manuscript:  Special Section PAPER (Special Section on Smart Multimedia & Communication Systems)
Category: 
Keyword: 
Viterbi decoderTVDAGPUCUDASDR
 Summary | Full Text:PDF(1.2MB)

Offline Permutation on the CUDA-enabled GPU
Akihiko KASAGI Koji NAKANO Yasuaki ITO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/12/01
Vol. E97-D  No. 12 ; pp. 3052-3062
Type of Manuscript:  Special Section PAPER (Special Section on Parallel and Distributed Computing and Networking)
Category: GPU
Keyword: 
memory machine modelsoffline permutationGPUCUDA
 Summary | Full Text:PDF(1.2MB)

An Optimal Implementation of the Approximate String Matching on the Hierarchical Memory Machine, with Performance Evaluation on the GPU
Duhu MAN Koji NAKANO Yasuaki ITO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/12/01
Vol. E97-D  No. 12 ; pp. 3063-3071
Type of Manuscript:  Special Section PAPER (Special Section on Parallel and Distributed Computing and Networking)
Category: GPU
Keyword: 
memory machine modelsapproximate string matchingedit distanceGPUCUDA
 Summary | Full Text:PDF(584.8KB)

Toward Concurrent Lock-Free Queues on GPUs
Xiangyu ZHANG Yangdong DENG Shuai MU 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/07/01
Vol. E97-D  No. 7 ; pp. 1901-1904
Type of Manuscript:  LETTER
Category: Fundamentals of Information Systems
Keyword: 
FIFO queueconcurrentlock-freeGPU
 Summary | Full Text:PDF(831.2KB)

Throughput and Power Efficiency Evaluation of Block Ciphers on Kepler and GCN GPUs Using Micro-Benchmark Analysis
Naoki NISHIKAWA Keisuke IWAI Hidema TANAKA Takakazu KUROKAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/06/01
Vol. E97-D  No. 6 ; pp. 1506-1515
Type of Manuscript:  PAPER
Category: Fundamentals of Information Systems
Keyword: 
throughputpower efficiencyGPUOpenCLAESCamelliaSC2000KeplerGraphics Core Next
 Summary | Full Text:PDF(2MB)

Efficient Parallel Interference Cancellation MIMO Detector for Software Defined Radio on GPUs
Rongchun LI Yong DOU Jie ZHOU Chen CHEN 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2014/06/01
Vol. E97-A  No. 6 ; pp. 1388-1395
Type of Manuscript:  PAPER
Category: Digital Signal Processing
Keyword: 
GPUSDRMIMO detectorparallel interference cancellation (PIC)
 Summary | Full Text:PDF(1.5MB)

Accelerating Extended Hamming Code Decoders on Graphic Processing Units for High Speed Communication
Md Shohidul ISLAM Jong-Myon KIM 
Publication:   IEICE TRANSACTIONS on Communications
Publication Date: 2014/05/01
Vol. E97-B  No. 5 ; pp. 1050-1058
Type of Manuscript:  PAPER
Category: Wireless Communication Technologies
Keyword: 
error codingextended Hamming code decoderGPU
 Summary | Full Text:PDF(2.9MB)

An Efficient Parallel SOVA-Based Turbo Decoder for Software Defined Radio on GPU
Rongchun LI Yong DOU Jiaqing XU Xin NIU Shice NI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2014/05/01
Vol. E97-A  No. 5 ; pp. 1027-1036
Type of Manuscript:  PAPER
Category: Digital Signal Processing
Keyword: 
GPUCUDASDRTurbo decoderSOVA
 Summary | Full Text:PDF(2.2MB)

Probabilistic Frequent Itemset Mining on a GPU Cluster
Yusuke KOZAWA Toshiyuki AMAGASA Hiroyuki KITAGAWA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/04/01
Vol. E97-D  No. 4 ; pp. 779-789
Type of Manuscript:  Special Section PAPER (Special Section on Data Engineering and Information Management)
Category: 
Keyword: 
GPUuncertain databasesprobabilistic frequent itemsets
 Summary | Full Text:PDF(1.4MB)

Asynchronous Memory Machine Models with Barrier Synchronization
Koji NAKANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2014/03/01
Vol. E97-D  No. 3 ; pp. 431-441
Type of Manuscript:  Special Section PAPER (Special Section on Foundations of Computer Science —New Trends in Theory of Computation and Algorithm—)
Category: Parallel and Distributed Computing
Keyword: 
memory machine modelsparallel algorithmscontiguous memory accessasynchronous modelsGPUCUDA
 Summary | Full Text:PDF(649.6KB)

Window Memory Layout Scheme for Alternate Row-Wise/Column-Wise Matrix Access
Lei GUO Yuhua TANG Yong DOU Yuanwu LEI Meng MA Jie ZHOU 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/12/01
Vol. E96-D  No. 12 ; pp. 2765-2775
Type of Manuscript:  PAPER
Category: Computer System
Keyword: 
window memory layout scheme (WMLS)alternate row-wise/column-wise matrix accessSDRAMGPUFPGA
 Summary | Full Text:PDF(1.4MB)

Optimal Parallel Algorithms for Computing the Sum, the Prefix-Sums, and the Summed Area Table on the Memory Machine Models
Koji NAKANO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/12/01
Vol. E96-D  No. 12 ; pp. 2626-2634
Type of Manuscript:  Special Section PAPER (Special Section on Parallel and Distributed Computing and Networking)
Category: 
Keyword: 
memory machine modelsprefix-sums computationparallel algorithmGPUCUDA
 Summary | Full Text:PDF(631.2KB)

Offline Permutation Algorithms on the Discrete Memory Machine with Performance Evaluation on the GPU
Akihiko KASAGI Koji NAKANO Yasuaki ITO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/12/01
Vol. E96-D  No. 12 ; pp. 2617-2625
Type of Manuscript:  Special Section PAPER (Special Section on Parallel and Distributed Computing and Networking)
Category: 
Keyword: 
memory machine modelsdata movementbank conflictshared memoryGPUCUDA
 Summary | Full Text:PDF(487.9KB)

Auto-Tuning of Thread Assignment for Matrix-Vector Multiplication on GPUs
Jinwei WANG Xirong MA Yuanping ZHU Jizhou SUN 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/11/01
Vol. E96-D  No. 11 ; pp. 2319-2326
Type of Manuscript:  PAPER
Category: Fundamentals of Information Systems
Keyword: 
GPUmatrix-vector multiplicationperformance tuningdense linear algebra
 Summary | Full Text:PDF(1.3MB)

Exploiting the Task-Pipelined Parallelism of Stream Programs on Many-Core GPUs
Shuai MU Dongdong LI Yubei CHEN Yangdong DENG Zhihua WANG 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/10/01
Vol. E96-D  No. 10 ; pp. 2194-2207
Type of Manuscript:  PAPER
Category: Computer System
Keyword: 
GPUtask-pipelinedynamic schedulingload balanceL2 cache
 Summary | Full Text:PDF(2.2MB)

High Throughput Parallelization of AES-CTR Algorithm
Nhat-Phuong TRAN Myungho LEE Sugwon HONG Seung-Jae LEE 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/08/01
Vol. E96-D  No. 8 ; pp. 1685-1695
Type of Manuscript:  PAPER
Category: Fundamentals of Information Systems
Keyword: 
AESmulti-coreGPUparallelization
 Summary | Full Text:PDF(1.9MB)

Fast and Robust 3D Correspondence Matching and Its Application to Volume Registration
Yuichiro TAJIMA Kinya FUDANO Koichi ITO Takafumi AOKI 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2013/04/01
Vol. E96-D  No. 4 ; pp. 826-835
Type of Manuscript:  Special Section PAPER (Special Section on Medical Imaging)
Category: Medical Image Processing
Keyword: 
CTMRIregistrationphase-only correlationGPU
 Summary | Full Text:PDF(1.7MB)

Parallel Sparse Cholesky Factorization on a Heterogeneous Platform
Dan ZOU Yong DOU Rongchun LI 
Publication:   IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2013/04/01
Vol. E96-A  No. 4 ; pp. 833-834
Type of Manuscript:  LETTER
Category: Algorithms and Data Structures
Keyword: 
sparse Cholesky factorizationsupernodal methodGPU
 Summary | Full Text:PDF(290.5KB)

Implementation of a GPU-Oriented Absorbing Boundary Condition for 3D-FDTD Electromagnetic Simulation
Keisuke DOHI Yuichiro SHIBATA Kiyoshi OGURI Takafumi FUJIMOTO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/12/01
Vol. E95-D  No. 12 ; pp. 2787-2795
Type of Manuscript:  Special Section PAPER (Special Section on Parallel and Distributed Computing and Networking)
Category: Parallel and Distributed Computing
Keyword: 
absorbing boundary conditionperfectly matched layerFDTDGPU
 Summary | Full Text:PDF(700KB)

A Parallel Implementation of the Gustafson-Kessel Clustering Algorithm with CUDA
Jeong Bong SEO Dae-Won KIM 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/04/01
Vol. E95-D  No. 4 ; pp. 1162-1165
Type of Manuscript:  LETTER
Category: Artificial Intelligence, Data Mining
Keyword: 
clusteringGustafson-KesselCUDAGPU
 Summary | Full Text:PDF(505.9KB)

Evaluation of GPU-Based Empirical Mode Decomposition for Off-Line Analysis
Pulung WASKITO Shinobu MIWA Yasue MITSUKURA Hironori NAKAJO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2011/12/01
Vol. E94-D  No. 12 ; pp. 2328-2337
Type of Manuscript:  Special Section PAPER (Special Section on Parallel and Distributed Computing and Networking)
Category: 
Keyword: 
Empirical Mode Decomposition (EMD)Hilbert-Huang Transform (HHT)GPUCUDA
 Summary | Full Text:PDF(909.5KB)

Computation-Communication Overlap of Linpack on a GPU-Accelerated PC Cluster
Junichi OHMURA Takefumi MIYOSHI Hidetsugu IRIE Tsutomu YOSHINAGA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2011/12/01
Vol. E94-D  No. 12 ; pp. 2319-2327
Type of Manuscript:  Special Section PAPER (Special Section on Parallel and Distributed Computing and Networking)
Category: 
Keyword: 
parallel processingmulti-core processorGPUcomputation-communication overlap
 Summary | Full Text:PDF(636.5KB)

A Parallel Framework for Fast Photomosaics
Dongwann KANG Sang-Hyun SEO Seung-Taek RYOO Kyung-Hyun YOON 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2011/10/01
Vol. E94-D  No. 10 ; pp. 2036-2042
Type of Manuscript:  PAPER
Category: Computer Graphics
Keyword: 
non-photorealistic renderingphotomosaicsGPU
 Summary | Full Text:PDF(3.3MB)

Real-Time Object Detection Using Adaptive Background Model and Margined Sign Correlation
Ayaka YAMAMOTO Yoshio IWAI Hiroshi ISHIGURO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2011/02/01
Vol. E94-D  No. 2 ; pp. 325-335
Type of Manuscript:  PAPER
Category: Image Recognition, Computer Vision
Keyword: 
object detectionadaptive background modelmargined sign correlationreal-time systemGPU
 Summary | Full Text:PDF(2.6MB)

Acceleration of Computing the Kleene Star in Max-Plus Algebra Using CUDA GPUs
Hiroyuki GOTO 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2011/02/01
Vol. E94-D  No. 2 ; pp. 371-374
Type of Manuscript:  LETTER
Category: Fundamentals of Information Systems
Keyword: 
Kleene starmax-plus algebraadjacency matrixDAGGPUCUDA
 Summary | Full Text:PDF(399.1KB)

Accelerating Smith-Waterman Algorithm for Biological Database Search on CUDA-Compatible GPUs
Yuma MUNEKAWA Fumihiko INO Kenichi HAGIHARA 
Publication:   IEICE TRANSACTIONS on Information and Systems
Publication Date: 2010/06/01
Vol. E93-D  No. 6 ; pp. 1479-1488
Type of Manuscript:  Special Section PAPER (Special Section on Info-Plosion)
Category: Parallel and Distributed Architecture
Keyword: 
Smith-Waterman algorithmsequence alignmentaccelerationGPUCUDA
 Summary | Full Text:PDF(815.4KB)