Home
Patent Search
IMT Blog
REGISTER
|
SIGN IN
United States Patent
7243372
Catherwood
July 10, 2007
Title
Modified Harvard architecture processor having data memory space mapped to program memory space with erroneous execution protection
Abstract
A processor has an architecture that provides the processing speed advantages of the Harvard architecture, but does not require two separate external memories in order to expand both data memory and program instruction memory. The processor has separate program memory space and data memory space, but provides the capability to map at least a portion of the program memory space to the data memory space. This allows most program instructions that are processed to obtain the speed advantages of simultaneous program instruction and data access. It also allows program memory space and data memory space to be expanded externally to the processor using only one external memory device that includes both program instructions and data. The processor includes a program memory space operable to store program instructions and data, a data memory space operable to store data, and mapping circuitry operable to map at least a portion of the program memory space to the data memory space. The program memory space may be internal to the processor. The processor may further comprise a page register operable to specify a location of the program memory space that is mapped to the data memory space.
Inventors:
Catherwood; Michael
(Pepperell,
MA
)
Assignee:
Microchip Technology Incorporated
(Chandler,
AZ
)
Appl. No.:
11/135,527
Filed:
May 23, 2005
PCT Pub Date:
July 10, 2007
Current U.S. Class:
726/23
714/35
714/38
Current International Class:
H04L 9/00 (20060101) G06F 11/00 (20060101) G06F 12/00 (20060101)
U.S. Patent Documents
20020194466
December 2002
Catherwood et al.
20030093656
May 2003
Masse et al.
3771146
November 1973
Cottton et al.
3781810
December 1973
Downing
3886524
May 1975
Appelt
3930253
December 1975
Maida
4025771
May 1977
Lynch, Jr. et al.
4074353
February 1978
Woods et al.
4090250
May 1978
Carlson et al.
4323981
April 1982
Nakamura
4379338
April 1983
Nishitani et al.
4398244
August 1983
Chu et al.
4408274
October 1983
Wheatley et al.
4451885
May 1984
Gerson et al.
4472788
September 1984
Yamazaki
4481576
November 1984
Bicknell
4488252
December 1984
Vassar
4511990
April 1985
Hagiwara et al.
4556938
December 1985
Parker et al.
4615005
September 1986
Maejima et al.
4626988
December 1986
George
4709324
November 1987
Kloker
4730248
March 1988
Watanabe et al.
4742479
May 1988
Kloker et al.
4768149
August 1988
Konopik et al.
4779191
October 1988
Greenblatt
4782457
November 1988
Cline
4800524
January 1989
Roesgen
4807172
February 1989
Nukiyama
4829420
May 1989
Stahle
4829460
May 1989
Ito
4839846
June 1989
Hirose et al.
4841468
June 1989
Miller et al.
4872128
October 1989
Shimizu
4882701
November 1989
Ishii
4926371
May 1990
Vassilliadis et al.
4941120
July 1990
Brown et al.
4943940
July 1990
New
4945507
July 1990
Ishida et al.
4959776
September 1990
Deerfield et al.
4977533
December 1990
Miyabayashi et al.
4984213
January 1991
Abdoo et al.
5007020
April 1991
Inskeep
5012441
April 1991
Retter
5032986
July 1991
Pathak et al.
5034887
July 1991
Yasui et al.
5038310
August 1991
Akagiri et al.
5040178
August 1991
Lindsay et al.
5056004
October 1991
Ohde et al.
5099445
March 1992
Studor et al.
5101484
March 1992
Kohn
5117498
May 1992
Miller et al.
5121431
June 1992
Wiener
5122981
June 1992
Taniguchi
5155823
October 1992
Tsue
5177373
January 1993
Nakamura
5197023
March 1993
Nakayama
5197140
March 1993
Balmer
5206940
April 1993
Murakami et al.
5212662
May 1993
Cocanougher et al.
5218239
June 1993
Boomer
5239654
August 1993
Ing-Simmons et al.
5276634
January 1994
Suzuki et al.
5282153
January 1994
Bartkowiak et al.
5327543
July 1994
Miura et al.
5327566
July 1994
Forsyth
5375080
December 1994
Davies
5379240
January 1995
Byrne
5386563
January 1995
Thomas
5392435
February 1995
Masui et al.
5418976
May 1995
Iida
5422805
June 1995
McIntyre et al.
5432943
July 1995
Mitsuishi
5448703
September 1995
Amini et al.
5448706
September 1995
Fleming et al.
5450027
September 1995
Gabara
5463749
October 1995
Wertheizer et al.
5469377
November 1995
Amano
5471600
November 1995
Nakamoto
5497340
March 1996
Uramoto et al.
5499380
March 1996
Iwata et al.
5504916
April 1996
Murakami et al.
5517436
May 1996
Andreas et al.
5525874
June 1996
Mallarapu et al.
5548544
August 1996
Matheny et al.
5561384
October 1996
Reents et al.
5561619
October 1996
Watanabe et al.
5564028
October 1996
Swoboda et al.
5568380
October 1996
Broadnax et al.
5568412
October 1996
Han et al.
5596760
January 1997
Ueda
5600813
February 1997
Nakagawa et al.
5611061
March 1997
Yasuda
5619711
April 1997
Anderson
5623646
April 1997
Clarke
5638524
June 1997
Kiuchi et al.
5642516
June 1997
Hedayat et al.
5649146
July 1997
Riou
5651121
July 1997
Davies
5657484
August 1997
Scarra
5659700
August 1997
Chen et al.
5682339
October 1997
Tam
5689693
November 1997
White
5694350
December 1997
Wolrich et al.
5696711
December 1997
Makineni
5701493
December 1997
Jaggar
5706460
January 1998
Craig et al.
5706466
January 1998
Dockser
5715470
February 1998
Asano et al.
5737570
April 1998
Koch
5740095
April 1998
Parant
5740419
April 1998
Potter
5740451
April 1998
Muraki et al.
5748516
May 1998
Goddard et al.
5748970
May 1998
Miyaji et al.
5764555
June 1998
McPherson et al.
5765216
June 1998
Weng et al.
5765218
June 1998
Ozawa et al.
5774711
June 1998
Henry et al.
5778237
July 1998
Yamamoto et al.
5778416
July 1998
Harrison et al.
5790443
August 1998
Shen et al.
5808926
September 1998
Gorshtein et al.
5812439
September 1998
Hansen
5812868
September 1998
Moyer et al.
5815693
September 1998
McDermott et al.
5825730
October 1998
Nishida et al.
5826072
October 1998
Knapp et al.
5826096
October 1998
Baxter
5828875
October 1998
Halvarsson et al.
5862065
January 1999
Muthusamy
5867726
February 1999
Ohsuga et al.
5875342
February 1999
Temple
5880984
March 1999
Burchfiel et al.
5892697
April 1999
Brakefield
5892699
April 1999
Duncan et al.
5894428
April 1999
Harada
5900683
May 1999
Rinehart et al.
5909385
June 1999
Nishiyama et al.
5917741
June 1999
Ng
5918252
June 1999
Chen et al.
5930159
July 1999
Wong
5930503
July 1999
Drees
5936870
August 1999
Im
5937199
August 1999
Temple
5938759
August 1999
Kamijo
5941940
August 1999
Prasad et al.
5943249
August 1999
Handlogten
5944816
August 1999
Dutton et al.
5951627
September 1999
Kiamilev et al.
5951679
September 1999
Anderson et al.
5974549
October 1999
Golan
5978825
November 1999
Divine et al.
5983333
November 1999
Kolagotla et al.
5991787
November 1999
Abel et al.
5991868
November 1999
Kamiyama et al.
5996067
November 1999
White
6009454
December 1999
Dummermuth
6014723
January 2000
Tremblay et al.
6018757
January 2000
Wong
6026489
February 2000
Wachi et al.
6044392
March 2000
Anderson et al.
6044434
March 2000
Oliver
6049858
April 2000
Kolagotia et al.
6055619
April 2000
North et al.
6058409
May 2000
Kozaki et al.
6058410
May 2000
Sharangpani
6058464
May 2000
Taylor
6061711
May 2000
Song et al.
6061780
May 2000
Shippy et al.
6061783
May 2000
Harriman
6076154
June 2000
Van Eijndhoven et al.
6084880
July 2000
Bailey et al.
6101521
August 2000
Kosiec
6101599
August 2000
Wright et al.
6115732
September 2000
Oberman et al.
6128728
October 2000
Dowling
6134574
October 2000
Oberman et al.
6144980
November 2000
Oberman
6145049
November 2000
Wong
6181151
January 2001
Wasson
6202163
March 2001
Gabzdyl et al.
6205467
March 2001
Lambrecht et al.
6209086
March 2001
Chi et al.
6243786
June 2001
Huang et al.
6243804
June 2001
Cheng
6260162
July 2001
Typaldos et al.
6282637
August 2001
Chan et al.
6292866
September 2001
Zaiki et al.
6295574
September 2001
MacDonald
6300800
October 2001
Schmitt et al.
6315200
November 2001
Silverbrook et al.
6356970
March 2002
Killian et al.
6377619
April 2002
Denk et al.
6397318
May 2002
Peh
6412081
June 2002
Koscal et al.
6487654
November 2002
Dowling
6523108
February 2003
James et al.
6564238
May 2003
Kim et al.
6633970
October 2003
Clift et al.
6658578
December 2003
Laurenti et al.
6681280
January 2004
Miyake et al.
6694398
February 2004
Zhao et al.
6728856
April 2004
Grosbach et al.
6751742
June 2004
Duhault et al.
6763478
July 2004
Bui
7069470
June 2006
Wilding et al.
Foreign Patent Documents
0 554 917
Aug., 1993
EP
0 855 643
Jul., 1998
EP
0 992 888
Dec., 2000
EP
0 992 889
Dec., 2000
EP
01037124
Feb., 1989
JP
96/11443
Apr., 1996
WO
Other References
Moon B I et al.: "A 32-bit RISC Microprocessor with DSP Functionality: Rapid Prototyping" IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, Institute of Electronics Information and Comm. Eng. Tokyo, JP, vol. E84-A No. 5, pp. 1339-1347, XP001060025 ISSN: 0916-8508, May 2001. cited by other .
Turley J: "Balancing Conflicting Requirements When Mixing RISC, DSPs" Computer Design, Pennwell Publ. Littleton, Massachusetts, IS, vol. 37, No. 10, pp. 46, 48, 50-53, XP000860706 ISSN:0010-4566, Oct. 1998. cited by other .
Levy M: "Microprocessor and DSP Technologies Unite for Embedded Applications" EDN Electrical Design News, Cahners Publishing Co., Newtown Massachusetts, US, No. Europe, pp. 73-74, 76, 78-80, XP000779113 ISSN: 0012-7515, Mar. 2, 1998. cited by other .
Intel, Pentium Processor Family Developer's Manual, vol. 3: Architecture and Programming Manual, , pp. 3-1, 3-2, 3-15, 41-1 to 14-30, 18-7, and 25-289 to 25-292, 1995. cited by other .
Intel, Embedded Intel486 Processor Family Developer's Manual, pp. 2-2, 3-17, 3-37, 4-5, 4-6, 10-1 to 10-12, 12-1 to 12-10, Oct. 1997. cited by other .
Moore, M "Z80 Family Interrupt Structure". Barleywood (online), retrieved from the internet <URL: http://www.gaby.de/z80/1653.htm>, 1997. cited by other .
PCT Search Report based on PCT/US02/16706, 6 pages, mailed Sep. 27, 2002. cited by other .
PCT Search Report based on PCT/US02/16705, 7 pages, mailed Sep. 9, 2002. cited by other .
PCT Search Report based on PCT/US02/16921, 4 pages, mailed Oct. 18, 2002. cited by other .
SPARC, International, Inc., "The SPARC Architecture Manual", Version 8, pp. 1-303, 1992. cited by other .
Weaver, et al. SPARC International, Inc. "The SPARC Arcitecture Manual", Version 9, pp. xiv, 137, 146-147, 200-204, 221-222, 234-236, 299, 1994-2000. cited by other .
Free On-Line Dictionary of Computing (FOLDOC), htp://wombat.doc.ic.ac.uk/foldoc/Search term: program counter, 1995. cited by other.~
Primary Examiner:
Revak; Christopher
Attorney, Agent or Firm:
Baker Botts L.L.P.
Parent Case Text
CROSS REFERENCE TO RELATED APPLICATION
This application is a continuation of U.S. patent application Ser. No. 09/870,460, which was filed on Jun. 1, 2001, by Michael Catherwood entitled "Modified Harvard Architecture Processor Having Data Memory Space Mapped To Program Memory Space With Erroneous Execution Protection, which is now U.S. Pat. No. 7,007,172."
Claims
What is claimed is:
1. A method of operating a processor comprising the steps of: mapping at least a portion of a program memory space to a data memory space; storing an entry into the program memory space that is mapped to the data memory space, the entry comprising data and a protection opcode; fetching an entry from the program memory space; attempting to execute the fetched entry; trapping the protection opcode; vectoring to a trap handler; and executing the trap handler.
2. The method of claim 1, wherein the trap handler is an illegal instruction trap handler and the step of executing the trap handler comprises the steps of: determining that the opcode is a protection opcode; and executing a software routine to handle the trap.
3. The method of claim 1, wherein the trap handler is a protection trap handler.
4. The method of claim 3, wherein the program memory space is internal to the processor.
5. The method of claim 4, wherein the processor is operably connected to an external memory device operable to store program instructions and data, the external memory device comprising program memory space.
6. A processor comprising circuitry operable to: map at least a portion of a program memory space to a data memory space; store an entry into the program memory space that is mapped to the data memory space, the entry comprising data and a protection opcode; fetch an entry from the program memory space; attempt to execute the fetched entry; trap the protection opcode; vector to a trap handler; and execute the trap handler.
7. The processor of claim 6, wherein the trap handler is an illegal instruction trap handler and the execution of the trap handler comprises: determining that the opcode is a protection opcode; and executing a software routine to handle the trap.
8. The processor of claim 6, wherein the trap handler is a protection trap handler.
9. The processor of claim 8, wherein the program memory space is internal to the processor.
10. The processor of claim 9, wherein the processor is operably connected to an external memory device operable to store program instructions and data, the external memory device comprising program memory space.
Description
FIELD OF THE INVENTION
The present invention relates to a modified Harvard architecture processor having data memory space mapped to program memory space and having protection for erroneous execution of data entries in the program memory space.
BACKGROUND OF THE INVENTION
Processors, including microprocessors, digital signal processors and microcontrollers, operate by running software programs that are embodied in one or more series of program instructions stored in a memory. The processors run the software by fetching the program instructions from the series of program instructions, decoding the program instructions and executing them. In addition to program instructions, data is also stored in memory that is accessible by the processor. Generally, the program instructions process data by accessing data in memory, modifying the data and storing the modified data into memory.
One well-known architecture for processors is known as the Harvard architecture. In this architecture, data and program instructions are stored in separate memories that can be accessed simultaneously. Because of this simultaneous access, the Harvard architecture provides significant processing speed advantages over other architectures. A typical Harvard architecture processor that includes internal memory includes two separate memories, one for data, and one for program instructions. In order to expand the memory capacity of such a processor, memory external to the processor must be added. However, since a Harvard architecture processor has two separate memories, in order to expand both data memory and program instruction memory, two separate external memories must be added. This is a significant disadvantage when low-cost systems are being built.
A need arises for a processor having an architecture that provides the processing speed advantages of the Harvard architecture, but does not require two separate external memories in order to expand both data memory and program instruction memory. One solution to this problem is described in U.S. Pat. No. 6,728,856. The described processor has separate program memory space and data memory space, but provides the capability to map at least a portion of the program memory space to the data memory space. This allows most program instructions that are processed to obtain the speed advantages of simultaneous program instruction and data access. It also allows program memory space and data memory space to be expanded externally to the processor using only one external memory device that includes both program instructions and data.
However, a problem arises with this solution. Under some circumstances, the processor may fetch and attempt to execute an entry in the program memory space that has been mapped to the data memory space and which contains data, not a program instruction. Such a situation may occur, for example, as a result of a bug in the software that is being executed. Attempted execution of data that is not a program instruction may cause unpredictable results. A need arises for a technique by which attempted execution of data that is not a program instruction may be detected and recovered from.
SUMMARY OF THE INVENTION
The present invention is a method, and a processor implementing the method, that provides the capability to detect and recover from attempted execution of data that is not a program instruction in a processor in which at least a portion of a program memory space to a data memory space. This allows the processor to provide the speed advantages and expansion advantages without the risk of unpredictable program execution behavior.
According to the present invention, a method of operating a processor comprises the steps of: mapping at least a portion of a program memory space to a data memory space, storing an entry into the program memory space that is mapped to the data memory space, the entry comprising data and a protection opcode, fetching an entry from the program memory space, attempting to execute the fetched entry, trapping the protection opcode, vectoring to a trap handler, and executing the trap handler.
In one aspect of the present invention, the trap handler is an illegal instruction trap handler and the step of executing the trap handler comprises the steps of determining that the opcode is a protection opcode, and executing a software routine to handle the trap. The program memory space may be internal to the processor. The processor may be operably connected to an external memory device operable to store program instructions and data, the external memory device comprising program memory space.
In one aspect of the present invention, the trap handler is a protection trap handler. The program memory space may be internal to the processor. The processor may be operably connected to an external memory device operable to store program instructions and data, the external memory device comprising program memory space.
BRIEF DESCRIPTION OF THE FIGURES
The above described features and advantages of the present invention will be more fully appreciated with reference to the detailed description and appended figures in which:
FIG. 1 depicts a functional block diagram of an embodiment of a processor chip within which the present invention may find application.
FIG. 2 depicts a functional block diagram of a data busing scheme for use in a processor 100, such as that shown in FIG. 1.
FIG. 3 depicts an exemplary memory map of a data space memory, which may be implemented in the processor shown in FIG. 2.
FIG. 4 depicts an exemplary block diagram of program memory space to data memory space mapping which may be implemented in the processor shown in FIG. 2, according to the present invention.
FIG. 5 depicts a block diagram of a data execution protection scheme, which may be implemented in the processor shown in FIG. 2, according to the present invention.
FIG. 6 depicts a processing flow diagram of a process for detection and handling of erroneous execution of a data entry, which may be implemented in the processor shown in FIG. 2, according to the present invention.
DETAILED DESCRIPTION
Overview of Processor Elements
FIG. 1 depicts a functional block diagram of an embodiment of a processor chip within which the present invention may find application. Referring to FIG. 1, a processor 100 is coupled to external devices/systems 140. The processor 100 may be any type of processor including, for example, a digital signal processor (DSP), a microprocessor, a microcontroller, or combinations thereof. The external devices 140 may be any type of systems or devices including input/output devices such as keyboards, displays, speakers, microphones, memory, or other systems which may or may not include processors. Moreover, the processor 100 and the external devices 140 may together comprise a stand alone system.
The processor 100 includes a program memory 105, an instruction fetch/decode unit 110, instruction execution units 115 data memory and registers 120, peripherals 125, data I/O 130, and a program counter and loop control unit 135. The bus 150, which may include one or more common buses, communicates data between the units as shown.
The program memory 105 stores software embodied in program instructions for execution by the processor 100. The program memory 105 may comprise any type of nonvolatile memory such as a read only memory (ROM), a programmable read only memory (PROM), an electrically programmable or an electrically programmable and erasable read only memory (EPROM or EEPROM) or flash memory. In addition, the program memory 105 may be supplemented with external nonvolatile memory 145 as shown to increase the complexity of software available to the processor 100. Alternatively, the program memory may be volatile memory, which receives program instructions from, for example, an external non-volatile memory 145. When the program memory 105 is nonvolatile memory, the program memory may be programmed at the time of manufacturing the processor 100 or prior to or during implementation of the processor 100 within a system. In the latter scenario, the processor 100 may be programmed through a process called in-line serial programming.
The instruction fetch/decode unit 110 is coupled to the program memory 105, the instruction execution units 115, and the data memory 120. Coupled to the program memory 105 and the bus 150 is the program counter and loop control unit 135. The instruction fetch/decode unit 110 fetches the instructions from the program memory 105 specified by the address value contained in the program counter 135. The instruction fetch/decode unit 110 then decodes the fetched instructions and sends the decoded instructions to the appropriate execution unit 115. The instruction fetch/decode unit 110 may also send operand information including addresses of data to the data memory 120 and to functional elements that access the registers.
The program counter and loop control unit 135 includes a program counter register (not shown) which stores an address of the next instruction to be fetched. During normal instruction processing, the program counter register may be incremented to cause sequential instructions to be fetched. Alternatively, the program counter value may be altered by loading a new value into it via the bus 150. The new value may be derived based on decoding and executing a flow control instruction such as, for example, a branch instruction. In addition, the loop control portion of the program counter and loop control unit 135 may be used to provide repeat instruction processing and repeat loop control as further described below.
The instruction execution units 115 receive the decoded instructions from the instruction fetch/decode unit 110 and thereafter execute the decoded instructions. As part of this process, the execution units may retrieve one or two operands via the bus 150 and store the result into a register or memory location within the data memory 120. The execution units may include an arithmetic logic unit (ALU) such as those typically found in a microcontroller. The execution units may also include a digital signal processing engine, a floating point processor, an integer processor, or any other convenient execution unit. A preferred embodiment of the execution units and their interaction with the bus 150, which may include one or more buses, is presented in more detail below with reference to FIG. 2.
The data memory and registers 120 are volatile memory and are used to store data used and generated by the execution units. The data memory 120 and program memory 105 are preferably separate memories for storing data and program instructions respectively. This format is a known generally as a Harvard architecture. It is noted, however, that according to the present invention, the architecture may be a Von-Neuman architecture or a modified Harvard architecture, which permits the use of some program space for data space. A dotted line is shown, for example, connecting the program memory 105 to the bus 150. This path may include logic for aligning data reads from program space such as, for example, during table reads from program space to data memory 120.
Referring again to FIG. 1, a plurality of peripherals 125 on the processor may be coupled to the bus 125. The peripherals may include, for example, analog to digital converters, timers, bus interfaces and protocols such as, for example, the controller area network (CAN) protocol or the Universal Serial Bus (USB) protocol and other peripherals. The peripherals exchange data over the bus 150 with the other units.
The data I/O unit 130 may include transceivers and other logic for interfacing with the external devices/systems 140. The data I/O unit 130 may further include functionality to permit in circuit serial programming of the Program memory through the data I/O unit 130.
FIG. 2 depicts a functional block diagram of a data busing scheme for use in a processor 100, such as that shown in FIG. 1, which has an integrated microcontroller arithmetic logic unit (ALU) 270 and a digital signal processing (DSP) engine 230. This configuration may be used to integrate DSP functionality to an existing microcontroller core. Referring to FIG. 2, the data memory 120 of FIG. 1 is implemented as two separate memories: an X-memory 210 and a Y-memory 220, each being respectively addressable by an X-address generator 250 and a Y-address generator 260. The X-address generator may also permit addressing the Y-memory space thus making the data space appear like a single contiguous memory space when addressed from the X address generator. The bus 150 may be implemented as two buses, one for each of the X and Y memory, to permit simultaneous fetching of data from the X and Y memories.
The W registers 240 are general purpose address and/or data registers. The DSP engine 230 is coupled to both the X and Y memory buses and to the W registers 240. The DSP engine 230 may simultaneously fetch data from each the X and Y memory, execute instructions which operate on the simultaneously fetched data and write the result to an accumulator (not shown) and write a prior result to X or Y memory or to the W registers 240 within a single processor cycle.
In one embodiment, the ALU 270 may be coupled only to the X memory bus and may only fetch data from the X bus. However, the X and Y memories 210 and 220 may be addressed as a single memory space by the X address generator in order to make the data memory segregation transparent to the ALU 270. The memory locations within the X and Y memories may be addressed by values stored in the W registers 240.
Any processor clocking scheme may be implemented for fetching and executing instructions. A specific example follows, however, to illustrate an embodiment of the present invention. Each instruction cycle is comprised of four Q clock cycles Q1-Q4. The four phase Q cycles provide timing signals to coordinate the decode, read, process data and write data portions of each instruction cycle.
According to one embodiment of the processor 100, the processor 100 concurrently performs two operations--it fetches the next instruction and executes the present instruction. Accordingly, the two processes occur simultaneously. The following sequence of events may comprise, for example, the fetch instruction cycle: Q1: Fetch Instruction Q2: Fetch Instruction Q3: Fetch Instruction Q4: Latch Instruction into prefetch register, Increment PC
The following sequence of events may comprise, for example, the execute instruction cycle for a single operand instruction: Q1: latch instruction into IR, decode, and determine addresses of operand data Q2: fetch operand Q3: execute function specified by instruction and calculate destination address for data Q4: write result to destination
The following sequence of events may comprise, for example, the execute instruction cycle for a dual operand instruction using a data pre-fetch mechanism. These instructions pre-fetch the dual operands simultaneously from the X and Y data memories and store them into registers specified in the instruction. They simultaneously allow instruction execution on the operands fetched during the previous cycle. Q1: latch instruction into IR, decode, and determine addresses of operand data Q2: pre-fetch operands into specified registers, execute operation in instruction Q3: execute operation in instruction, calculate destination address for data Q4: complete execution, write result to destination
An exemplary memory map of data space memory 300 is shown in FIG. 3. Data space memory 300 includes a plurality of blocks of memory, divided into X address memory and Y address memory. Typically, data space memory 300 is implemented as random access read-write memory (RAM), so as to allow data to be read and written as necessary. However, read-only memory (ROM) may also be advantageously used for at least a portion of data space memory 300. For example, constant data values, look up tables, etc., may be usefully stored in ROM. In the example shown in FIG. 3, X address memory includes memory blocks 302, 304, 306, and 308, while Y address memory includes memory block 310. Data space memory 300 is split into two blocks, X address memory and Y address memory. A key element of this architecture is that the Y address memory space is a subset of the X address memory space, and is fully contained within the X address memory space. In order to provide an apparent linear addressing space, the X and Y address memory spaces would typically have contiguous addresses, although this is not an architectural necessity.
In the example shown in FIG. 3, memory block 302 includes a block of contiguous memory, starting at data memory location 0.times.0000. Memory block 302 is reserved in X address memory space and is directly addressable using memory direct instructions. The remaining X address memory and Y address memory spaces are indirectly addressable using other instructions. In the example shown in FIG. 3, Y address memory space 310 is located between two blocks of X address memory space, block 304
and 306. However, this is only an example, as the Y address memory space 310 may be located anywhere within the non-reserved X address memory space. The partition between the X and Y address spaces is arbitrary and is determined by the memory decode shown in FIG. 2. Both the X and Y address generator can generate any effective address (EA) within the range of data memory space 300. However, accesses to memory addresses that are in the other address space, or to memory addresses that are not implemented with physical memory will return data of 0.times.0000 (all zeros).
Memory block 308 is shown in FIG. 3 as being an X address memory block. Memory block 308, which includes at least a portion of data memory space 300, may be used as X address memory, Y address memory, or a mixture of X address memory and Y address memory. When used as X address memory, memory block 308 may be mapped into program memory space. This provides transparent access to constant data, such as stored constants, look up tables, etc., from the X address data memory space without the need to use special instructions. This feature allows the mapping of a portion of data memory space into an unused area of program memory, and since all unused internal addresses are mapped externally, to the external memory bus. This is shown in FIG.
4, which is an exemplary block diagram of the program memory space to data memory space mapping. Data memory space block 308, which is a portion of data memory space 300 is mapped to a data memory space page 402 in internal program memory space 404. The location of data memory space page 402 in internal program memory space 404 is specified by page register 406. Internal program memory space 404 is still used for program instruction access, as specified by program counter (PC) 408.
External memory device 410 is connected to the external memory bus 412 of the processor. External memory device 410 includes external program/data memory space 414. Since all unused internal addresses are mapped externally to the external memory bus, data memory space mapped page 402 is also mapped to external data memory space mapped page 416, which is located in external program/data memory space 412. If external memory device 410 is a RAM, then data may be read from and written to external data memory space mapped page 416. External program/data space 414 may also include external program memory space 418, which may be separate from external data memory space mapped page 416, or which may overlap with external data memory space mapped page 416.
Since the program memory space may include data that is used when a portion of the program memory space is mapped to the data memory space, there is some danger that the processor will erroneously fetch and attempt to execute that data. This may happen, for example, when there is a bug in a software program that is executing on the processor that sets the program counter (PC) to a memory location in the program memory space that happens to be storing data. This problem can arise when data is stored in internal program memory space and is even more likely to arise when data is stored in an external memory device. The present invention includes a mechanism for detecting such erroneous accesses and provides the capability to handle such errors.
A block diagram of the data execution protection scheme of the present invention is shown in FIG. 5. Data memory space 502, including a plurality of data entries 504, is mapped from a data memory block portion 506 of program memory space 508. Program memory space 508 also includes one or more blocks of program instructions, such as program instruction blocks 510 and 512. As shown, each data entry, such as data entry 514, in data memory space 502 includes 16 bits of data. Each program instruction entry, such as program instruction entry 516, in program memory space 508 includes 24 bits of program instruction. The entries in data memory block 506 of program memory space 508, such as entry 518, are likewise 24 bits. Since a data entry only requires 16 bits, such as data portion 520 of entry 518, 8 bits of each entry in data memory block 506 are not used by data and may be used for other functions. In the present invention, this other portion is used to contain a protection opcode
522, which allows erroneous execution of a data entry to be detected.
A process 600 for detection and handling of erroneous execution of a data entry is shown in FIG. 6. The process begins with step 602, in which data is stored to the data memory space that was mapped from program memory space. This data is stored to the lower 16 bits of each entry that is used. In addition to the data that is stored, a protection opcode is stored to the upper 8 bits (byte) of each data entry that is used. Typically, the protection opcode will be stored when the data entry is stored. For example, since program memory is typically implemented using non-volatile memory, the program instructions stored in the program memory are stored to the program memory during the production process. The protection opcodes may easily be stored to the program memory by this step in the production process. This is true both for internal program memory and for non-volatile external memory.
In step 604, program memory space is mapped to data memory space by issuance of the proper program instructions. In step 606, the processor erroneously fetches and attempts to execute data that was stored in an entry in data memory space that was mapped from program memory space. Since the processor is fetching a program instruction, the processor treats the entry as a program instruction entry and fetches the entire 24 bits of the entry. The upper 8 bits of the entry are the protection opcode, while the lower 16 bits are the data in the entry. The processor attempts to execute the fetched entry, and in particular attempts to decode the protection opcode. In step 608, this attempted decode of the protection opcode causes a processor trap to occur. A trap can be considered to be a non-maskable, nestable interrupt. They provide a means by which erroneous operation can be corrected during software debug and during operation of the software. Upon occurrence of a trap, the execution flow of the processor is vectored to a trap handler in step 610. That is, the program counter of the processor is loaded with a value that points to the trap handler. The trap handler is a software routine that takes the appropriate corrective action upon occurrence of the trapped condition. The value is stored in an exception vector table that includes vectors for a variety of exception conditions, such as reset, stack overflow, address error, illegal instruction trap, arithmetic error, etc. Each entry in the exception vector table points to an exception handler that takes the appropriate action upon occurrence of the corresponding exception. In step 612, the trap handler deals with the error. Typically, the trap handler simply forces a reset of the processor. This would be done, for example, in an implementation in which a stand-alone application is executing in the processor. Since an attempt to execute a data entry is likely a result of a serious program error, performing a reset of the processor is often the best way of recovering from such an error. In an implementation in which there is an operating system controlling the processor, it may be possible to simply terminate the application program that caused the error and allow the operating system to recover from the error.
In a preferred embodiment, the illegal instruction trap vector is used to vector the processor to a routine that handles the attempted execution of a protection opcode. The protection opcode must be one of the possible 8 bit opcodes that is not used by any instruction of the processor. Attempted execution of this opcode will result in an illegal instruction trap. The illegal instruction trap handler must then examine the opcode that caused the illegal instruction trap, determine that the opcode is the protection opcode, and execute the appropriate software routines to handle the trap, which typically includes recovering from the error condition. Alternatively, there may be a defined protection trap that is separate from the illegal instruction trap. Attempted execution of the protection opcode will cause a protection trap to occur, rather than a general illegal instruction trap. Since the processor will have already determined the opcode that was attempted to be executed was the protection opcode, the protection trap then need only execute the appropriate software routines to handle the error condition.
In the embodiment described above, internal program memory is organized as a plurality of 24 bit entries, each of which may contain a 16 bit data entry and an 8 bit protection opcode. The present invention also contemplates a number of additional and alternative embodiments. For example, an external memory may be used in which 24 bit entries are stored. In this embodiment, a 24 bit entry may contain a 16 bit data entry and an 8 bit protection opcode. If the external memory is a non-volatile memory, then the data entries and protection opcodes, along with any program instructions, may be stored in the external memory during the production process. If the external memory is a volatile memory, then the data entries and the protection opcodes must be stored to the external memory by the processor.
Alternatively, data entries may be stored in the external memory as 16 bit data entries, without protection opcodes. In this embodiment, the external memory may be connected to the processor using a memory bus configuration that is aware that the data entries are 16 bits. For example, the memory bus connected to the external memory may be 16 bits wide, rather than the 24 bits wide that would be needed for program instructions. As another example, the address range of the external memory that is mapped to data memory may be used by the processor to identify a portion of the external memory that is storing data entries rather than program entries. In either example, the processor can detect an attempted program instruction access of the external memory or the portion of external memory that is storing data entries. Upon detection of such an attempted access, the processor may directly perform a protection trap. Alternatively, the processor may simply force a protection opcode onto the top 8 bits of the program instruction bus, which will also cause a protection trap to be performed.
While specific embodiments of the present invention have been illustrated and described, it will be understood by those having ordinary skill in the art that changes may be made to those embodiments without departing from the spirit and scope of the invention. For example, the present invention has been described in terms of 16 bit data entries, 24 bit program instruction entries, and 8 bit opcodes. However, one of skill in the art will recognize that such specific values are only examples, and that other arrangements and numbers of bits may be used without departing from the spirit and scope of the invention. The present invention contemplates any and all such alternative arrangements and numbers of bits.
* * * * *
Quick Search
patentmonkey
UpgradeAccount
IMTBlog
BestLegalBids