Dynamic Multi-bit Parallel Computing Method Based on Reconfigurable Structure

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14488))

Included in the following conference series:

International Conference on Algorithms and Architectures for Parallel Processing

329 Accesses

Abstract

Reconfigurable architecture has great potential in computation-intensive and memory-intensive applications due to its flexible information configuration. Aiming at the problem of low computing efficiency caused by the inconsistency between different granularity data and the underlying hardware structure in applications such as communication baseband signal processing, a parallel computing method supporting multi-bit data is proposed, and a dynamic granularity configuration structure used this method is designed based on reconfigurable array processors. The structure divides the calculation granularity into 8 bits, 16 bits, and 32 bits, and realizes four functions: data-combination, data-splitting, parallel-addition, and parallel-multiplication. These features increase the parallelism and flexibility of array structures. The experimental results show that the speedup ratio can reach 1.5 within a certain error range, the running time is reduced by about 20%, and the code complexity is also significantly reduced. In addition, the maximum operating frequency of the dynamic configuration circuit is 133.5 MHz by FPGA comprehensive implementation, which can realize the dynamic configuration of different granularity data in the calculation and achieve parallel computing of multi-bit data.

Supported by National Key R &D Program of China

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Reconfigurable Arithmetic for High-Performance Computing

A Reconfigurable Functional Unit for Modular Operations

Dynamic Scheduling of Pipelined Functional Units in Coarse-Grained Reconfigurable Array Elements

References

Lu, Y., Liu, L., Zhu, J., et al.: Architecture, challenges and applications of dynamic reconfigurable computing. J. Semicond. 41(2), 4–13 (2020)
Article Google Scholar
Chiu, J.-C., Yan, Z.-Y., Liu, Y.-C.: Design and implementation of the CNN accelator based on multi-streaming SIMD mechanisms. In: Hsieh, S.-Y., Hung, L.-J., Klasing, R., Lee, C.-W., Peng, S.-L. (eds.) New Trends in Computer Technologies and Applications: 25th International Computer Symposium, ICS 2022, Taoyuan, Taiwan, December 15–17, 2022, Proceedings, pp. 460–473. Springer Nature Singapore, Singapore (2022). https://doi.org/10.1007/978-981-19-9582-8_40
Chapter Google Scholar
Sharma, H., Park, J., Suda, N.: Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks. In: ACM/IEEE 45th annual international symposium on computer architecture (ISCA). IEEE 2018, 764–775 (2018)
Google Scholar
Moss, D.J., Krishnan, S., Nurvitadhi, E., et al.: A customizable matrix multiplication framework for the intel harpv2 xeon+fpga platform: a deep learning case study. In: 2018 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. ACM, pp. 107–116 (2018)
Google Scholar
Faraone, J. Kumm, M., Hardieck, M., et al.: AddNet: deep neural networks using fpga-optimized multipliers. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 28(1), 115–128 (2020)
Google Scholar
Tang, S.N.: Area-efficient parallel multiplication units for CNN accelerators with output channel parallelization. IEEE Trans. Very Large Scale Integr. (VLSI) Systems. 31(3), 406–410 (2023)
Google Scholar
Sun, M., Li, Z., Lu, A., et al.: FILM-QNN: efficient FPGA acceleration of deep neural networks with intra-layer, mixed-precision quantization. In: Proceedings of the 28th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA), pp. 134–145 (2022)
Google Scholar
Wang, N., Nia, J., Li, J., et al.: A compression strategy to accelerate LSTM meta-learning on FPGA. ICT Express 8(3), 322–327 (2022)
Article Google Scholar
Nataraj Urs, H.D., Venkata Siva Reddy, R., Gudodagi, R., et al.: A novel algorithm for reconfigurable architecture for software-defined radio receiver on baseband processor for demodulation. Sustainable Computing. Springer, Cham, pp. 187–206 (2023). https://doi.org/10.1007/978-3-031-13577-4_11
Umuroglu, Y., Conficconi, D., Rasnayake, L., et al.: Optimizing bit-serial matrix multiplication for reconfigurable computing. ACM Trans. Reconfigurable Technol. Syst. (TRETS) 12(3), 1–24 (2019)
Article Google Scholar
Liu, K., Tian, Z., Li, Z., et al.: RfLoc: a reflector-assisted indoor localization system using a single-antenna AP. IEEE Trans. Instrum. Meas. 70(3), 1–16 (2021)
Google Scholar
Wang, A., Xu, W., Sun, H., et al.: Arrhythmia classifier using binarized convolutional neural network for resource-constrained devices. In: 2022 4th International Conference on Communications, Information System and Computer Engineering (CISCE), Shenzhen, China, 2022, pp. 213–220 (2022)
Google Scholar
Stepchenkov, Y.A., Khilko, D.V., Shikunov, Y.I.: Filter kernels preliminary benchmarking, DSP, for recurrent data-flow architecture. In: IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (ElConRus). IEEE 2021, pp. 2040–2044 (2021)
Google Scholar
Deng, J., Jiang, L., Zhu, Y., et al.: HRM: H-tree based reconfiguration mechanism in reconfigurable homogeneous PE array. J. Semiconductors. 41(2), 1–9 (2020)
Google Scholar
Shan, R., Jiang, L., Wu, H., He, F., Liu, X.: Dynamical self-reconfigurable mechanism for data-driven cell array. J. Shanghai Jiaotong Univ. (Science) 26(4), 511–521 (2021). https://doi.org/10.1007/s12204-021-2319-z
Article Google Scholar
Maki, A., Miyashita, D., Nakata, K., et al.: FPGA-based CNN processor with filter-wise-optimized bit precision. In: 2018 IEEE Asian Solid-State Circuits Conference (A-SSCC). IEEE, pp. 47–50 (2018)
Google Scholar
Chen, Y., Du, H., Chang, L.: A reconfigurable micro-processing element for mixed precision CNNs. In: 2022 14th International Conference on Measuring Technology and Mechatronics Automation (ICMTMA). IEEE, pp. 1–5 (2022)
Google Scholar
Liu, W., Liao, Q., Qiao, F., et al.: Approximate designs for fast Fourier transform (FFT) with application to speech recognition. IEEE Trans. Circuits Syst. I Regul. Pap. 66(12), 4727–4739 (2019)
Article Google Scholar

Download references

Acknowledgements

This work was supported by National Key R &D Program of China (2022ZD0119001); Key projects of National Natural Science Foundation of China (61834005).

Author information

Authors and Affiliations

Xi’an University of Science and Technology, Xi’an, 710600, Shaanxi, China
Lin Jiang, Shuai Liu, Jiayang Zhu & Yuancheng Li
Xi’an University of Posts and Telecommunications, Xi’an, 710100, Shaanxi, China
Rui Shan

Authors

Lin Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Shuai Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jiayang Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Rui Shan
View author publications
You can also search for this author in PubMed Google Scholar
Yuancheng Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jiayang Zhu .

Editor information

Editors and Affiliations

Royal Melbourne Institute of Technology, Melbourne, VIC, Australia
Zahir Tari
Tianjin University, Tianjin, China
Keqiu Li
University of Arizona, Tucson, AZ, USA
Hongyi Wu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, L., Liu, S., Zhu, J., Shan, R., Li, Y. (2024). Dynamic Multi-bit Parallel Computing Method Based on Reconfigurable Structure. In: Tari, Z., Li, K., Wu, H. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2023. Lecture Notes in Computer Science, vol 14488. Springer, Singapore. https://doi.org/10.1007/978-981-97-0801-7_20

Download citation

DOI: https://doi.org/10.1007/978-981-97-0801-7_20
Published: 01 March 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0800-0
Online ISBN: 978-981-97-0801-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Dynamic Multi-bit Parallel Computing Method Based on Reconfigurable Structure

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Reconfigurable Arithmetic for High-Performance Computing

A Reconfigurable Functional Unit for Modular Operations

Dynamic Scheduling of Pipelined Functional Units in Coarse-Grained Reconfigurable Array Elements

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Dynamic Multi-bit Parallel Computing Method Based on Reconfigurable Structure

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Reconfigurable Arithmetic for High-Performance Computing

A Reconfigurable Functional Unit for Modular Operations

Dynamic Scheduling of Pipelined Functional Units in Coarse-Grained Reconfigurable Array Elements

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation