LLX > Neil Parker > Apple II > 6502 Instruction Set
Though the 6502 instruction set has a number of quirks and irregularities, large portions of it can be broken up into regular patterns. An understanding of these patterns can be beneficial to authors of assemblers or disassemblers for 6502 code--for example, the Apple II ROM uses the information described below to greatly reduce the size of the instruction tables used by the built-in machine language disassembler.
Note that the discussion below assumes a knowledge of 6502 programming. If you're looking for a tutorial or general programming reference for the 6502, I recommend starting at 6502.org. There are also some useful documents at Western Design Center.
Shown below are the instructions of the 6502, 65C02, and 65C816 processors. GREEN UPPERCASE indicates instructions found on all processors; Yellow Mixed Case indicates instructions introduced on the 65C02, and red lowercase indicates instructions found only on the 65C816. The bit manipulation instructions found only on the Rockwell and WDC versions of the 65C02 are not included in the table, nor are the "undocumented" instructions of the original 6502. (However, after noting the search engine strings commonly used to locate this page, I have added discussions of these points below.)
|0x||BRK b||ORA (d,X)||cop b||ora d,S||Tsb d||ORA d||ASL d||ora [d]||PHP||ORA #||ASL A||phd||Tsb a||ORA a||ASL a||ora al|
|1x||BPL r||ORA (d),Y||Ora (d)||ora (d,S),Y||Trb d||ORA d,X||ASL d,X||ora [d],Y||CLC||ORA a,Y||Inc A||tcs||Trb a||ORA a,X||ASL a,X||ora al,X|
|2x||JSR a||AND (d,X)||jsl al||and d,S||BIT d||AND d||ROL d||and [d]||PLP||AND #||ROL A||pld||BIT a||AND a||ROL a||and al|
|3x||BMI r||AND (d),Y||And (d)||and (d,S),Y||Bit d,X||AND d,X||ROL d,X||and [d],Y||SEC||AND a,Y||Dec A||tsc||Bit a,X||AND a,X||ROL a,X||and al,X|
|4x||RTI||EOR (d,X)||wdm||eor d,S||mvp s,d||EOR d||LSR d||eor [d]||PHA||EOR #||LSR A||phk||JMP a||EOR a||LSR a||eor al|
|5x||BVC r||EOR (d),Y||Eor (d)||eor (d,S),Y||mvn s,d||EOR d,X||LSR d,X||eor [d],Y||CLI||EOR a,Y||Phy||tcd||jmp al||EOR a,X||LSR a,X||eor al,X|
|6x||RTS||ADC (d,X)||per rl||adc d,S||Stz d||ADC d||ROR d||adc [d]||PLA||ADC #||ROR A||rtl||JMP (a)||ADC a||ROR a||adc al|
|7x||BVS r||ADC (d),Y||Adc (d)||adc (d,S),Y||Stz d,X||ADC d,X||ROR d,X||adc [d],Y||SEI||ADC a,Y||Ply||tdc||Jmp (a,X)||ADC a,X||ROR a,X||adc al,X|
|8x||Bra r||STA (d,X)||brl rl||sta d,S||STY d||STA d||STX d||sta [d]||DEY||Bit #||TXA||phb||STY a||STA a||STX a||sta al|
|9x||BCC r||STA (d),Y||Sta (d)||sta (d,S),Y||STY d,X||STA d,X||STX d,Y||sta [d],Y||TYA||STA a,Y||TXS||txy||Stz a||STA a,X||Stz a,X||sta al,X|
|Ax||LDY #||LDA (d,X)||LDX #||lda d,S||LDY d||LDA d||LDX d||lda [d]||TAY||LDA #||TAX||plb||LDY a||LDA a||LDX a||lda al|
|Bx||BCS r||LDA (d),Y||Lda (d)||lda (d,S),Y||LDY d,X||LDA d,X||LDX d,Y||lda [d],Y||CLV||LDA a,Y||TSX||tyx||LDY a,X||LDA a,X||LDX a,Y||lda al,X|
|Cx||CPY #||CMP (d,X)||rep #||cmp d,S||CPY d||CMP d||DEC d||cmp [d]||INY||CMP #||DEX||wai||CPY a||CMP a||DEC a||cmp al|
|Dx||BNE r||CMP (d),Y||Cmp (d)||cmp (d,S),Y||pei d||CMP d,X||DEC d,X||cmp [d],Y||CLD||CMP a,Y||Phx||stp||jml (a)||CMP a,X||DEC a,X||cmp al,X|
|Ex||CPX #||SBC (d,X)||sep #||sbc d,S||CPX d||SBC d||INC d||sbc [d]||INX||SBC #||NOP||xba||CPX a||SBC a||INC a||sbc al|
|Fx||BEQ r||SBC (d),Y||Sbc (d)||sbc (d,S),Y||pea a||SBC d,X||INC d,X||sbc [d],Y||SED||SBC a,Y||Plx||xce||jsr (a,X)||SBC a,X||INC a,X||sbc al,X|
Most instructions that explicitly reference memory locations have bit patterns of the form aaabbbcc. The aaa and cc bits determine the opcode, and the bbb bits determine the addressing mode.
Instructions with cc = 01 are the most regular, and are therefore considered first. The aaa bits determine the opcode as follows:
And the addressing mode (bbb) bits:
Putting it all together:
The only irregularity is the absence of the nonsensical immediate STA instruction.
Next we consider the cc = 10 instructions. These have a completely different set of opcodes:
The addressing modes are similar to the 01 case, but not quite the same:
Note that bbb = 100 and 110 are missing. Also, with STX and LDX, "zero page,X" addressing becomes "zero page,Y", and with LDX, "absolute,X" becomes "absolute,Y".
These fit together like this:
Most of the gaps in this table are easy to understand. Immediate mode makes no sense for any instruction other than LDX, and accumulator mode for DEC and INC didn't appear until the 65C02. The slots that "STX A" and "LDX A" would occupy are taken by TXA and TAX respectively, which is exactly what one would expect. The only inexplicable gap is the absence of a "STX abs,Y" instruction.
Next, the cc = 00 instructions. Again, the opcodes are different:
It's debatable whether the JMP instructions belong in this list...I've included them because they do seem to fit, provided one considers the indirect JMP a separate opcode rather than a different addressing mode of the absolute JMP.
The addressing modes are the same as the 10 case, except that accumulator mode is missing.
And here's how they fit together:
Some of the gaps in this table are understandable (e.g. the lack of an immediate mode for JMP, JMP(), and STY), but others are not (e.g. the absence of "zp,X" for CPY and CPX, and the absence of "abs,X" for STY, CPY, and CPX). Note that if accumulator mode (bbb = 010) were available, "LDY A" would be A8, which falls in the slot occupied by TAY, but the pattern breaks down elsewhere--TYA is 98, rather than 88, which we would expect it to be if it corresponded to the nonexistant "STY A".
No instructions have the form aaabbb11.
The conditional branch instructions all have the form xxy10000. The flag indicated by xx is compared with y, and the branch is taken if they are equal.
This gives the following branches:
The remaining instructions are probably best considered simply by listing them. Here are the interrupt and subroutine instructions:
(JSR is the only absolute-addressing instruction that doesn't fit the aaabbbcc pattern.)
Other single-byte instructions:
The time required for 6502 instruction to execute is regular and predictable. The primary rule is this:
Each byte read from or written to memory requires one clock cycle.
However, that simple rule doesn't account for everything--there are a number of cases when instructions require more clock cycles than the one-cycle-per-byte rule indicates. (Most of these exceptions arise because the next "real" memory access can't occur until some internal operation finishes first.)
The above-described instructions (the ones shown in GREEN UPPERCASE in the table at the top of this page) are the only ones documented in any manufacturer's official data sheets. The question often arises, "What do all those other leftover bytes do if you try to execute them as instructions?"
In general the behavior of instructions other than those listed above cannot be described exactly, as they tend to be somewhat unstable, and do not always behave the same way on chips made by different manufacturers, and some instructions don't even behave the same way twice on the same chip. Those looking for a precise listing of "undocumented" instruction behaviors will have to look elsewhere, and should beware that the behaviors described on other web pages may be specific to 6502s made by a particular (often unspecified) manufacturer.
However, there are some facts that seem to be common across all 6502s. The most insteresting case is the cc = 11 instructions: these execute the adjacent cc = 01 and cc = 10 instructions simultaneously. For example, AF executes AD ("LDA absolute") and AE ("LDX absolute") at the same time, putting the same value in both the accumulator and the X register.
In some cases the 01 and 10 instructions are incompatible. For example, 8F executes 8D ("STA absolute") and 8E ("STX absolute") at the same time. So which register actually gets written to memory? Usually some mixture of the two, in a manner that varies depending on who made the 6502, when it was made, the phase of the moon, and other unpredictable variables.
The behavior of the 11 instructions is especially problematic in those cases where the adjacent 01 or 10 instruction is also undocumented. Sometimes you can get a partial idea of what happens by looking at what the missing 01 or 10 instruction would be if that opcode/addressing mode combination weren't missing. Xxxx1011 instructions are also problematic--some of these seem to mix not only the adjacent 01 and 10 instructions, but also the immediate mode of the corresponding 10 instruction.
Most of the missing 00, 01, and 10 instructions seem to behave like NOPs, but using the addressing mode indicated by the bbb bits. But apparently this isn't always reliable--there are reports of some of these instructions occasionally locking up the processor.
Instructions of the form xxxx0010 usually lock up the processor, so that a reset is required to recover. The instructions 82, C2, and E2 (corresponding to the nonexistant immediate mode of STX, DEC, and INC) may sometimes behave as two-byte NOPs, but don't count on it.
The new instructions of the 65C02 are much less logical than those listed above. The designers of the 65C02 apparently chose to continue leaving the cc = 11 instructions empty, and this didn't leave much space for new instructions. Some instructions landed in logical places, but others had to be assigned wherever there was room, whether it made sense or not.
The new zero-page indirect addressing mode fills the previously-unused bbb = 100 slot of the cc = 10 instructions, but the opcodes are those of the cc = 01 instructions.
"JMP (abs,X)" is right where it ought to be (011 111 00), if one continues to regard the indirect JMP as a separate opcode from the absolute JMP:
"BIT zp,X" and "BIT abs,X" ended up exactly where one would expect them to be, but "BIT #" had to be moved because its slot was already taken by JSR:
TSB ended up in a reasonable place (000bbb00):
But the above assigments exhaust the logical possibilities for opcodes that explicity reference memory locations, so TRB and STZ had to be put wherever room could be found:
That leaves the relative branch instruction
and the single-byte instructions:
|INC A||DEC A||PHY||PLY||PHX||PLX|
Actually, I lied when I said above that the designers of the 65C02 chose to leave the cc = 11 instructions unused. On 65C02s made by Rockwell and by WDC, some of these instructions are used for additional bit setting, clearing, and testing instructions. These instructions are missing on 65C02s made by other manufacturers. (And since this page is part of a set of Apple II-related pages, I should point out that Apple never shipped any computers that used Rockwell or WDC 65C02s, so none of the instructions in this section are available on an unmodified Apple II.)
The bit set and clear instructions have the form xyyy0111, where x is 0 to clear a bit or 1 to set it, and yyy is which bit at the memory location to set or clear.
Similarly, the test-and-branch instructions are of the form xyyy1111, where x is 0 to test whether the bit is 0, or 1 to test whether it is 1, and yyy is which bit to test.
Additionally, the WDC version of the 65C02 includes the 65C816's STP and WAI instructions (see below).
There aren't really any undocumented instructions on the 65C02--any instructions not listed above are documented as performing no operation.
However, these alternate NOPs are not created equal. Some have one- or two-byte operands (which they don't do anything with), and they take different amounts of time to execute.
Actually, it's not quite correct to say that these instructions don't do anything with their operands. A memory read does occur, generally using the addressing mode you would expect from the bit patterns--which may be significant if there happens to be a memory-mapped hardware device at the target address.
A correspondent with appropriate testing hardware reports that the behavior of 01011100 (5C) is strange: "5C bb aa", after fetching its three bytes, accesses FFbb, and then spends four cycles accessing FFFF.
Generally, instruction timing on the 65C02 is the same as on the 6502. There are, however, a few exceptions:
Note that BRA is an ordinary branch instruction that always branches.
The 65C816 uses the cc = 11 instructions, but not for Rockwell bit-manipulation opcodes. Most of these are put to work supplying the new long addressing modes of the 65C816:
These combine with the 01 opcodes:
The missing 010 and 110 instructions are all single-byte instructions:
The remaining instructions are a grab-bag assigned to the few remaining unused positions:
The 65816 generally follows 6502 timing rather than 65C02 timing...indeed, in emulation mode when the direct page register is page-aligned, all 6502 instructions take the same number of cycles as on the original 6502. New 65C02 instructions take their 65C02 times. (The 6502 decimal mode and indirect jump bugs are fixed without requiring the extra cycle that the 65C02 requires.)
In native mode, the key point to remember is that 16-bit quantities are transferred between the CPU and memory one 8-bit byte at a time.
Other 65816-specific exceptions:
LLX > Neil Parker > Apple II > 6502 Instruction Set
Original: July 27, 2004
Modified: May 3, 2005
Modified: May 29, 2016--Added a new note about 65C02 "undocumented" opcodes
Modified: February 24, 2019--Added information about instruction timing
Modified: April 4, 2019--Added missing info about 65618 REP and SEP timing