Crusoe Processors

  • Uploaded by: api-3760105
  • 0
  • 0
  • November 2019
  • PDF

This document was uploaded by user and they confirmed that they have the permission to share it. If you are author or own the copyright of this book, please report to us by using this DMCA report form. Report DMCA


Overview

Download & View Crusoe Processors as PDF for free.

More details

  • Words: 707
  • Pages: 27
THE TECHNOLOGY BEHIND CRUSOE PROCESSORS CHETAN D

introduction 

The Crusoe™ processors are an x86compatible family of solutions



It combines

• strong performance • remarkably low power consumption



It is fundamentally software based



Power savings come from replacing large numbers of transistors with software.

Fundamentals  The

• •

Crusoe processor consists of

a hardware engine logically surrounded by a software layer

• The engine is a very long instruction word (VLIW) CPU capable of executing up to four operations in each clock cycle

Instruction Molecule

 it

has been designed purely for fast lowpower implementation

 It

uses conventional CMOS fabrication.

 The

surrounding software layer gives x86 programs the impression that they are running on x86 hardware.

Code morphing software 

The software layer Morphing™ software.

is

called

Code



It dynamically “morphs” x86 instructions into VLIW instructions



Code Morphing software includes a number of advanced features to achieve good system-level performance.

 The

Code Morphing software is fundamentally a dynamic translation system  A program that compiles instructions for one instruction set architecture (in this case, the x86 target ISA) into instructions for another ISA (the VLIW host ISA).

 The

Code Morphing software resides in a ROM and is the first program to start executing when the processor boots.  The Code Morphing Software supports ISA, and is the only thing x86 code sees; the only program written directly for the VLIW engine is the Code Morphing software itself.

Code morphing software

Mobile PII Mobile PII Mobile PIII

TM3120

TM5400

Process

.25m

.25m shrink

.18m

.22m

.18m

On-chip L1 Cache

32KB

32KB

32KB

96KB

128KB

On-chip L1 Cache

0

256KB

256KB

0

256KB

Die Size

130mm

180mm

106mm

77mm

73mm

Deco din g an d S chedul ing 1.

Conventional x86 superscalar processors fetch x86 binary instructions from memory

3.

Decode them into micro-operations

5.

These are reordered by out-of-order dispatch hardware

7.

Fed to the functional units for parallel execution.

TRANSLATION 

The software translates instructions once,



Then saves the resulting translation in a translation cache



The next time the (now translated) x86 code is executed, the system skips the translation step and directly executes the existing optimized translation.

TRANSLATION 

The translation cache, along with the Code Morphing code, resides in a separate memory space that is inaccessible to x86 code.



The size of this memory space can be

• •

Set at boot time, or The operating system adjustable.

can

make

the

size

TRA NSL ATIO N  Molecules

are executed in-order by the

hardware  But

the work of the original x86 instructions is performed out of order.

TRA NSL ATIO N 

Molecules explicitly encode instruction-level parallelism

the



Hence they can be executed by a simple, fast and low-power VLIW engine



The hardware need not perform any complex instruction reordering itself.

Features of CRUSOE  CRUSOE

translation does not have unconditional JMP like x86

 The

path selector simply “follows” the branch and continues translation at the target of the JMP.



The translator has replaced the two internal conditional branches with “select” instructions (which conditionally pick one of two results)



In effect, the Code Morphing system is speculatively executing both legs of a branch and picking the correct result later.



This greatly increases efficiency

 Registers

have been aggressively renamed in software

 There

is no need for a complex (and power consuming) register renamer in hardware.

 The

scheduler has rearranged the instructions to execute out of order relative to the original x86 “source” code.



The Crusoe alias hardware has been used in the translation (in molecules 5 through 8) to hoist loads above stores and thus pack the code more effectively

Application The Crusoe processor solutions have been designed for lightweight (two to four pound) mobile computers and Internet access devices such as handhelds and web pads.  They can give these devices PC capabilities and unplugged running times of up to a day. 

Advantages  Saves

millions of logic transistors and cuts power consumption on the order of 60–70% over conventional approaches

 Achieves

low power consumption without sacrificing high performance for realworld applications.

QUESTIONS ?? ?

TH AN K YOU

Related Documents

Crusoe Processors
November 2019 16
Processors
June 2020 8
Robinson Crusoe
November 2019 26
Crusoe Processor
July 2020 7
Robison Crusoe
November 2019 9
Robinson Crusoe
July 2020 10