CN114116005B - Immediate data storage method based on AIGPU architecture - Google Patents
Immediate data storage method based on AIGPU architecture Download PDFInfo
- Publication number
- CN114116005B CN114116005B CN202111432949.5A CN202111432949A CN114116005B CN 114116005 B CN114116005 B CN 114116005B CN 202111432949 A CN202111432949 A CN 202111432949A CN 114116005 B CN114116005 B CN 114116005B
- Authority
- CN
- China
- Prior art keywords
- instruction
- immediate
- temporary
- helper
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/30181—Instruction operation extension or modification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/38—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation
- G06F7/48—Methods or arrangements for performing computations using exclusively denominational number representation, e.g. using binary, ternary, decimal representation using non-contact-making devices, e.g. tube, solid state device; using unspecified devices
- G06F7/57—Arithmetic logic units [ALU], i.e. arrangements or devices for performing two or more of the operations covered by groups G06F7/483 – G06F7/556 or for performing logical operations
- G06F7/575—Basic arithmetic logic units, i.e. devices selectable to perform either addition, subtraction or one of several logical operations, using, at least partially, the same circuitry
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/30—Arrangements for executing machine instructions, e.g. instruction decode
- G06F9/38—Concurrent instruction execution, e.g. pipeline or look ahead
- G06F9/3885—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units
- G06F9/3887—Concurrent instruction execution, e.g. pipeline or look ahead using a plurality of independent parallel functional units controlled by a single instruction for multiple data lanes [SIMD]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/20—Processor architectures; Processor configuration, e.g. pipelining
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Computing Systems (AREA)
- Executing Machine-Instructions (AREA)
Abstract
Description
| Instructions | Types of | Operation code | Operand A | Operand B | Result register |
| Compound instruction | 2 | > | R A | R B | Temp&& |
| Compound instruction | 2 | <= | R C | R D | Temp&& |
| End instruction | 0 | == | R E | R F | R G |
Claims (5)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202111432949.5A CN114116005B (en) | 2021-11-29 | 2021-11-29 | Immediate data storage method based on AIGPU architecture |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202111432949.5A CN114116005B (en) | 2021-11-29 | 2021-11-29 | Immediate data storage method based on AIGPU architecture |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN114116005A CN114116005A (en) | 2022-03-01 |
| CN114116005B true CN114116005B (en) | 2022-12-23 |
Family
ID=80371331
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202111432949.5A Active CN114116005B (en) | 2021-11-29 | 2021-11-29 | Immediate data storage method based on AIGPU architecture |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN114116005B (en) |
Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5201056A (en) * | 1990-05-02 | 1993-04-06 | Motorola, Inc. | RISC microprocessor architecture with multi-bit tag extended instructions for selectively attaching tag from either instruction or input data to arithmetic operation output |
| US5925122A (en) * | 1996-08-30 | 1999-07-20 | Nec Corporation | Data processing unit which pre-fetches instructions of different lengths to conduct processing |
| US6539470B1 (en) * | 1999-11-16 | 2003-03-25 | Advanced Micro Devices, Inc. | Instruction decode unit producing instruction operand information in the order in which the operands are identified, and systems including same |
| CN1477520A (en) * | 2002-08-21 | 2004-02-25 | 先进数字芯片株式会社 | Central processor with extended instruction |
| CN1625731A (en) * | 2002-01-31 | 2005-06-08 | Arc国际公司 | Configurable data processor with multiple length instruction set architectures |
| US7047396B1 (en) * | 2000-06-22 | 2006-05-16 | Ubicom, Inc. | Fixed length memory to memory arithmetic and architecture for a communications embedded processor system |
| US7415599B1 (en) * | 2005-11-01 | 2008-08-19 | Zilog, Inc. | Instruction operation and operand memory location determined based on preceding instruction operation and operand memory location |
| JP2014160393A (en) * | 2013-02-20 | 2014-09-04 | Casio Comput Co Ltd | Microprocessor and arithmetic processing method |
| CN105677298A (en) * | 2015-12-30 | 2016-06-15 | 李朝波 | Method and device for extending immediate operand in computer instruction |
| CN108304217A (en) * | 2018-03-09 | 2018-07-20 | 中国科学院计算技术研究所 | The method that the instruction of long bit wide operands is converted into short bit wide operands instruction |
Family Cites Families (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20040193844A1 (en) * | 2003-03-24 | 2004-09-30 | Sun Microsystems, Inc. | Load and/or store queue emptying technique to facilitate atomicity in processor execution of helper set |
| US20110314263A1 (en) * | 2010-06-22 | 2011-12-22 | International Business Machines Corporation | Instructions for performing an operation on two operands and subsequently storing an original value of operand |
| US9110802B2 (en) * | 2010-11-05 | 2015-08-18 | Advanced Micro Devices, Inc. | Processor and method implemented by a processor to implement mask load and store instructions |
| CN102221987B (en) * | 2011-05-11 | 2014-10-01 | 西安电子科技大学 | Instruction set encoding method based on embedded special instruction set processor |
| US10061580B2 (en) * | 2016-02-25 | 2018-08-28 | International Business Machines Corporation | Implementing a received add program counter immediate shift (ADDPCIS) instruction using a micro-coded or cracked sequence |
| CN113656071B (en) * | 2021-10-18 | 2022-02-08 | 深圳市智想科技有限公司 | RISC architecture based CPU instruction set system and CPU system |
-
2021
- 2021-11-29 CN CN202111432949.5A patent/CN114116005B/en active Active
Patent Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5201056A (en) * | 1990-05-02 | 1993-04-06 | Motorola, Inc. | RISC microprocessor architecture with multi-bit tag extended instructions for selectively attaching tag from either instruction or input data to arithmetic operation output |
| US5925122A (en) * | 1996-08-30 | 1999-07-20 | Nec Corporation | Data processing unit which pre-fetches instructions of different lengths to conduct processing |
| US6539470B1 (en) * | 1999-11-16 | 2003-03-25 | Advanced Micro Devices, Inc. | Instruction decode unit producing instruction operand information in the order in which the operands are identified, and systems including same |
| US7047396B1 (en) * | 2000-06-22 | 2006-05-16 | Ubicom, Inc. | Fixed length memory to memory arithmetic and architecture for a communications embedded processor system |
| CN1625731A (en) * | 2002-01-31 | 2005-06-08 | Arc国际公司 | Configurable data processor with multiple length instruction set architectures |
| CN1477520A (en) * | 2002-08-21 | 2004-02-25 | 先进数字芯片株式会社 | Central processor with extended instruction |
| US7415599B1 (en) * | 2005-11-01 | 2008-08-19 | Zilog, Inc. | Instruction operation and operand memory location determined based on preceding instruction operation and operand memory location |
| JP2014160393A (en) * | 2013-02-20 | 2014-09-04 | Casio Comput Co Ltd | Microprocessor and arithmetic processing method |
| CN105677298A (en) * | 2015-12-30 | 2016-06-15 | 李朝波 | Method and device for extending immediate operand in computer instruction |
| CN108304217A (en) * | 2018-03-09 | 2018-07-20 | 中国科学院计算技术研究所 | The method that the instruction of long bit wide operands is converted into short bit wide operands instruction |
Non-Patent Citations (3)
| Title |
|---|
| ARM指令中合法立即数的判断方法;杜俊;《甘肃科技》;20141030(第20期);全文 * |
| GPU的发展历程、未来趋势及研制实践;熊庭刚;《微电子与智能制造》;20200630;第2卷(第2期);第36-40页 * |
| High-performance extendable instruction set computing;Heui Lee 等;《Proceedings 6th Australasian Computer Systems Architecture Conference. ACSAC 2001》;20020807;第89-94页 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN114116005A (en) | 2022-03-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9830156B2 (en) | Temporal SIMT execution optimization through elimination of redundant operations | |
| EP2483787B1 (en) | Efficient predicated execution for parallel processors | |
| CN109213522B (en) | Remote atomic operation instructions | |
| US20220004391A1 (en) | Processor having multiple cores, shared core extension logic, and shared core extension utilization instructions | |
| US8069340B2 (en) | Microprocessor with microarchitecture for efficiently executing read/modify/write memory operand instructions | |
| US8458677B2 (en) | Generating code adapted for interlinking legacy scalar code and extended vector code | |
| CN104205042A (en) | Context switching mechanism for processing cores with general purpose CPU cores and tightly coupled accelerators | |
| KR101772299B1 (en) | Instruction to reduce elements in a vector register with strided access pattern | |
| TWI659357B (en) | Manage instruction order in the processor pipeline | |
| Lazo et al. | Adaptable register file organization for vector processors | |
| CN115910207A (en) | Implementation of dedicated instructions for accelerating Smith-Waterman sequence alignment | |
| Wang et al. | Architecture and compiler support for gpus using energy-efficient affine register files | |
| JP5193624B2 (en) | Data processor | |
| CN114116005B (en) | Immediate data storage method based on AIGPU architecture | |
| HK40069733B (en) | Immediate data storage method based on aigpu architecture | |
| US12572357B2 (en) | Vector mask buffers in a vector instruction execution pipeline | |
| HK40069733A (en) | Immediate data storage method based on aigpu architecture | |
| JP4444305B2 (en) | Semiconductor device | |
| CN112130970A (en) | Hardware support for dual memory atomic operations | |
| CN115905786A (en) | Techniques for Accelerated Smith-Wortman Sequence Alignment | |
| CN117136357A (en) | Inter-cluster shared data management in sub-NUMA clusters | |
| CN114281414B (en) | Data writing method of URF register in AIGPU architecture | |
| JP5610551B2 (en) | Data processor | |
| US20260086718A1 (en) | Systems and methods for providing extended memory access in a parallel processor | |
| HK40070897A (en) | Data writing method of urf register in aigpu architecture |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 40069733 Country of ref document: HK |
|
| CB03 | Change of inventor or designer information | ||
| CB03 | Change of inventor or designer information |
Inventor after: Hou Hongchao Inventor after: Ge Jianming Inventor after: Sang Yongqi Inventor after: Zhao Peng Inventor after: Wang Donghui Inventor after: Yao Fei Inventor after: Qiu Xiaogang Inventor before: Hou Hongchao Inventor before: Ge Jianming Inventor before: Sang Yongqi Inventor before: Zhao Peng Inventor before: Wang Donghui Inventor before: Yao Fei Inventor before: Qiu Xiaogang |
|
| GR01 | Patent grant | ||
| GR01 | Patent grant |