8086 Opcode Map

This is an HTML-ized version of the opcode map for the 8086 processor. It is based on the opcode map from Appendix A of Volume 2 of the Intel Architecture Software Developer's Manual. A plain-text version - easily parsable by software - is also available.

This map was constructed by taking a map for a more recent x86 processor and removing information irrelevant to the (much earlier) 8086 processor. I wanted as simple a map as possible, and, to that end, this map contains some lacunae:

* Opcodes D8 through DF - the co-processor escape opcodes - are here treated as undefined opcodes. I wanted to focus on integer opcodes in this map, as floating-point would be exceedingly rare in production 8086 code.
* Opcode 0F - "POP CS" on the 8086, and the first byte in multi-byte opcodes on later processors - is also treated as an undefined opcode. I wouldn't expect to see this in 8086 code (as the "POP CS" instruction is particularly useless) and wanted to treat its appearance as an error condition.
* Opcodes C6, C7, and 8F require that the reg bits of the ModR/M byte which follows them be equal to zero. Other values are illegal. This restriction is not shared with other opcodes with "E"-addressed arguments, and not reflected in the map.
* Arguments are not given for a large number of opcodes (e.g. A4 through A7, 9C, 9D) which correspond to instructions which take no arguments when written as assembly code (e.g. MOVSB, MOVSW, CMPSB, CMPSW, PUSHF, POPF). I want to use this map to build a disassembler, not a simulated processor, and the extra arguments would only be burdensome.

In addition to the information that was removed, this map contains two known errors. These were added intentionally so that the results of disassembly based upon this map would mirror the output of DOS DEBUG:

* Opcode 84 should have arguments of Eb, Gb - not Gb, Eb. This distinction only affects (dis)assembly, since the order of operands is irrelevant to TEST's function. Nevertheless, DOS DEBUG has it backwards, and I duplicate the error here.
* Opcode 85 should have arguments of Ev, Gv - not Gv, Ev. All the preceeding remarks about opcode 84 apply equally here.

To use the map, find the cell in the row labelled with the opcode's most significant 4 bits, and the column labelled with the opcode's least significant 4 bits. (The map is split in half; columns 0-7 appear in the first part, while columns 8-F appear in the second.) For instance, opcode 23 appears in the 3rd row, 4th column of the first part of the map (AND Gv, Ev). Opcode 4E appears in the 5th row, 7th column of the second part of the map (DEC SI).

Arguments are either a pair of letters - the first in upper case, the second in lower case - or a special symbol. An upper case/lower case pair can be interpreted by looking up the upper case letter in the "Argument Addressing Codes" table, and the lower case letter in the "Argument Operand Codes" table. Other special symbols can be looked up in the "Special Argument Codes" table.

Continuing the earlier example, opcode 23 resolves to an AND Gv, Ev instruction. Both arguments are upper case/lower case pairs. G represents a general-purpose register selected by the reg bits of a ModR/M byte following the opcode byte. E represents a memory location or general-purpose register selected by the mod and r/m bits of a ModR/M byte following the opcode byte. Both operands are of type "v", so both are WORDs. Opcode 23, therefore, takes the logical AND of a WORD from a 16-bit register or memory location with the WORD from a 16-bit register, and stores the result to the latter register. The particular register(s) and/or memory location involved can be determined by examining the ModR/M byte following the opcode, and consulting page 2-5 of the Instruction Set Reference.

Opcode 4E, on the other hand, resolves to a DEC SI instruction. The SI argument is not an upper case/lower case pair, so we check the special code table. SI turns out to represent (as one might expect) the 16-bit SI register, so opcode 4E simply decrements this register by 1. (Yes, with nearly 30 years hindsight, there probably shouldn't be an entire opcode devoted to this operation.)

The one remaining complexity involves "group" opcodes, such as 80. These opcodes perform different operations depending upon the value of the reg bits in the ModR/M byte following the opcode byte. For example, opcode 80 followed by a ModR/M byte with a reg of 4 is an AND Eb, Ib instruction, while that same opcode followed by a ModR/M byte with a reg of 7 is a CMP Eb, Ib instruction.

To disassemble "group" opcodes, consult the "Opcode Extensions" table for any entry in the opcode map with a mneumonic of the form GRP#. Find the entry in the row labelled with the mneumonic and the column labeled with the value of the reg field of the ModR/M byte following the opcode byte. Note that arguments may be specified in both the opcode map and the opcode extensions table (e.g. for opcode F6, reg 0); if this occurs, the entries in the extensions table take precedence. Normally, however, the arguments from the opcode map are used.

As far as I know, this opcode map is, modulo the lacunae and errata mentioned above, correct. I've used it to implement a full 8086 integer disassembler, the results of which agree with DOS DEBUG. However, if you see something that doesn't look right, please contact me. If you're interested in reading more about the disassembler, the following posts might be worth a look:

Opcode Map (Part 1)
	0	1	2	3	4	5	6	7
0	ADD E b G b	ADD E v G v	ADD G b E b	ADD G v E v	ADD AL I b	ADD AX I v	PUSH ES	POP ES
1	ADC E b G b	ADC E v G v	ADC G b E b	ADC G v E v	ADC AL I b	ADC AX I v	PUSH SS	POP SS
2	AND E b G b	AND E v G v	AND G b E b	AND G v E v	AND AL I b	AND AX I v	ES:	DAA
3	XOR E b G b	XOR E v G v	XOR G b E b	XOR G v E v	XOR AL I b	XOR AX I v	SS:	AAA
4	INC AX	INC CX	INC DX	INC BX	INC SP	INC BP	INC SI	INC DI
5	PUSH AX	PUSH CX	PUSH DX	PUSH BX	PUSH SP	PUSH BP	PUSH SI	PUSH DI
6
7	JO J b	JNO J b	JB J b	JNB J b	JZ J b	JNZ J b	JBE J b	JA J b
8	GRP1 E b I b	GRP1 E v I v	GRP1 E b I b	GRP1 E v I b	TEST G b E b	TEST G v E v	XCHG G b E b	XCHG G v E v
9	NOP	XCHG CX AX	XCHG DX AX	XCHG BX AX	XCHG SP AX	XCHG BP AX	XCHG SI AX	XCHG DI AX
A	MOV AL O b	MOV AX O v	MOV O b AL	MOV O v AX	MOVSB	MOVSW	CMPSB	CMPSW
B	MOV AL I b	MOV CL I b	MOV DL I b	MOV BL I b	MOV AH I b	MOV CH I b	MOV DH I b	MOV BH I b
C			RET I w	RET	LES G v M p	LDS G v M p	MOV E b I b	MOV E v I v
D	GRP2 E b 1	GRP2 E v 1	GRP2 E b CL	GRP2 E v CL	AAM I 0	AAD I 0		XLAT
E	LOOPNZ J b	LOOPZ J b	LOOP J b	JCXZ J b	IN AL I b	IN AX I b	OUT I b AL	OUT I b AX
F	LOCK		REPNZ	REPZ	HLT	CMC	GRP3a E b	GRP3b E v

Opcode Map (Part 1)

ADD

E b G b

ADD

E v G v

ADD

G b E b

ADD

G v E v

ADD

AL I b

ADD

AX I v

PUSH

POP

ADC

E b G b

ADC

E v G v

ADC

G b E b

ADC

G v E v

ADC

AL I b

ADC

AX I v

PUSH

POP

AND

E b G b

AND

E v G v

AND

G b E b

AND

G v E v

AND

AL I b

AND

AX I v

ES:

DAA

XOR

E b G b

XOR

E v G v

XOR

G b E b

XOR

G v E v

XOR

AL I b

XOR

AX I v

SS:

AAA

INC

PUSH

J b

JNO

J b

JNB

J b

JNZ

J b

JBE

J b

E b I b

E v I v

E b I b

E v I b

TEST

G b E b

TEST

G v E v

XCHG

G b E b

XCHG

G v E v

NOP

XCHG

XCHG

XCHG

XCHG

XCHG

XCHG

XCHG

MOV

AL O b

MOV

AX O v

MOV

O b AL

MOV

O v AX

MOVSB

MOVSW

CMPSB

CMPSW

MOV

AL I b

MOV

CL I b

MOV

DL I b

MOV

BL I b

MOV

AH I b

MOV

CH I b

MOV

DH I b

MOV

BH I b

RET

I w

RET

LES

G v M p

LDS

G v M p

MOV

E b I b

MOV

E v I v

E b 1

E v 1

E b CL

E v CL

AAM

I 0

AAD

I 0

XLAT

LOOPNZ

J b

LOOPZ

J b

LOOP

J b

JCXZ

J b

AL I b

AX I b

OUT

I b AL

OUT

I b AX

LOCK

REPNZ

REPZ

HLT

CMC

GRP3a

E b

GRP3b

E v

Opcode Map (Part 2)
	8	9	A	B	C	D	E	F
0	OR E b G b	OR E v G v	OR G b E b	OR G v E v	OR AL I b	OR AX I v	PUSH CS
1	SBB E b G b	SBB E v G v	SBB G b E b	SBB G v E v	SBB AL I b	SBB AX I v	PUSH DS	POP DS
2	SUB E b G b	SUB E v G v	SUB G b E b	SUB G v E v	SUB AL I b	SUB AX I v	CS:	DAS
3	CMP E b G b	CMP E v G v	CMP G b E b	CMP G v E v	CMP AL I b	CMP AX I v	DS:	AAS
4	DEC AX	DEC CX	DEC DX	DEC BX	DEC SP	DEC BP	DEC SI	DEC DI
5	POP AX	POP CX	POP DX	POP BX	POP SP	POP BP	POP SI	POP DI
6
7	JS J b	JNS J b	JPE J b	JPO J b	JL J b	JGE J b	JLE J b	JG J b
8	MOV E b G b	MOV E v G v	MOV G b E b	MOV G v E v	MOV E w S w	LEA G v M	MOV S w E w	POP E v
9	CBW	CWD	CALL A p	WAIT	PUSHF	POPF	SAHF	LAHF
A	TEST AL I b	TEST AX I v	STOSB	STOSW	LODSB	LODSW	SCASB	SCASW
B	MOV AX I v	MOV CX I v	MOV DX I v	MOV BX I v	MOV SP I v	MOV BP I v	MOV SI I v	MOV DI I v
C			RETF I w	RETF	INT 3	INT I b	INTO	IRET
D
E	CALL J v	JMP J v	JMP A p	JMP J b	IN AL DX	IN AX DX	OUT DX AL	OUT DX AX
F	CLC	STC	CLI	STI	CLD	STD	GRP4 E b	GRP5 E v

Opcode Map (Part 2)

E b G b

E v G v

G b E b

G v E v

AL I b

AX I v

PUSH

SBB

E b G b

SBB

E v G v

SBB

G b E b

SBB

G v E v

SBB

AL I b

SBB

AX I v

PUSH

POP

SUB

E b G b

SUB

E v G v

SUB

G b E b

SUB

G v E v

SUB

AL I b

SUB

AX I v

CS:

DAS

CMP

E b G b

CMP

E v G v

CMP

G b E b

CMP

G v E v

CMP

AL I b

CMP

AX I v

DS:

AAS

DEC

POP

J b

JNS

J b

JPE

J b

JPO

J b

JGE

J b

JLE

J b

MOV

E b G b

MOV

E v G v

MOV

G b E b

MOV

G v E v

MOV

E w S w

LEA

G v M

MOV

S w E w

POP

E v

CBW

CWD

CALL

A p

WAIT

PUSHF

POPF

SAHF

LAHF

TEST

AL I b

TEST

AX I v

STOSB

STOSW

LODSB

LODSW

SCASB

SCASW

MOV

AX I v

MOV

CX I v

MOV

DX I v

MOV

BX I v

MOV

SP I v

MOV

BP I v

MOV

SI I v

MOV

DI I v

RETF

I w

RETF

INT

I b

INTO

IRET

CALL

J v

JMP

J v

JMP

A p

JMP

J b

OUT

OUT

CLC

STC

CLI

STI

CLD

STD

GRP4

E b

GRP5

E v

Opcode Map (Opcode Extensions)
	0	1	2	3	4	5	6	7
GRP1	ADD	OR	ADC	SBB	AND	SUB	XOR	CMP
GRP2	ROL	ROR	RCL	RCR	SHL	SHR		SAR
GRP3a	TEST E b I b		NOT	NEG	MUL	IMUL	DIV	IDIV
GRP3b	TEST E v I v		NOT	NEG	MUL	IMUL	DIV	IDIV
GRP4	INC	DEC
GRP5	INC	DEC	CALL	CALL M p	JMP	JMP M p	PUSH

Opcode Map (Opcode Extensions)

GRP1

ADD

ADC

SBB

AND

SUB

XOR

CMP

GRP2

ROL

ROR

RCL

RCR

SHL

SHR

SAR

GRP3a

TEST

E b I b

NOT

NEG

MUL

IMUL

DIV

IDIV

GRP3b

TEST

E v I v

NOT

NEG

MUL

IMUL

DIV

IDIV

GRP4

INC

DEC

GRP5

INC

DEC

CALL

M p

JMP

M p

PUSH

Argument Addressing Codes
A	Direct address. The instruction has no ModR/M byte; the address of the operand is encoded in the instruction. Applicable, e.g., to far JMP (opcode EA).
E	A ModR/M byte follows the opcode and specifies the operand. The operand is either a general-purpose register or a memory address. If it is a memory address, the address is computed from a segment register and any of the following values: a base register, an index register, a displacement.
G	The reg field of the ModR/M byte selects a general register.
I	Immediate data. The operand value is encoded in subsequent bytes of the instruction.
J	The instruction contains a relative offset to be added to the address of the subsequent instruction. Applicable, e.g., to short JMP (opcode EB), or LOOP.
M	The ModR/M byte may refer only to memory. Applicable, e.g., to LES and LDS.
O	The instruction has no ModR/M byte; the offset of the operand is encoded as a WORD in the instruction. Applicable, e.g., to certain MOVs (opcodes A0 through A3).
S	The reg field of the ModR/M byte selects a segment register.

Argument Addressing Codes

Direct address. The instruction has no ModR/M byte; the address of the operand is encoded in the instruction. Applicable, e.g., to far JMP (opcode EA).

A ModR/M byte follows the opcode and specifies the operand. The operand is either a general-purpose register or a memory address. If it is a memory address, the address is computed from a segment register and any of the following values: a base register, an index register, a displacement.

The reg field of the ModR/M byte selects a general register.

Immediate data. The operand value is encoded in subsequent bytes of the instruction.

The instruction contains a relative offset to be added to the address of the subsequent instruction. Applicable, e.g., to short JMP (opcode EB), or LOOP.

The ModR/M byte may refer only to memory. Applicable, e.g., to LES and LDS.

The instruction has no ModR/M byte; the offset of the operand is encoded as a WORD in the instruction. Applicable, e.g., to certain MOVs (opcodes A0 through A3).

The reg field of the ModR/M byte selects a segment register.

Argument Operand Codes
0	Byte argument. Unusual in that arguments of this type are suppressed in ASM output when they have the default value of 10 (0xA). Applicable, e.g., to AAM and AAD.
b	Byte argument.
p	32-bit segment:offset pointer.
w	Word argument.
v	Word argument. (The 'v' code has a more complex meaning in later x86 opcode maps, from which this was derived, but here it's just a synonym for the 'w' code.)

Argument Operand Codes

Byte argument. Unusual in that arguments of this type are suppressed in ASM output when they have the default value of 10 (0xA). Applicable, e.g., to AAM and AAD.

Byte argument.

32-bit segment:offset pointer.

Word argument.

Word argument. (The 'v' code has a more complex meaning in later x86 opcode maps, from which this was derived, but here it's just a synonym for the 'w' code.)

Special Argument Codes
AL	8-bit register: The low byte of AX
CL	8-bit register: The low byte of CX
DL	8-bit register: The low byte of DX
BL	8-bit register: The low byte of BX
AH	8-bit register: The high byte of AX
CH	8-bit register: The high byte of CX
DH	8-bit register: The high byte of DX
BH	8-bit register: The high byte of BX
AX	16-bit register: The 'accumulator' register
CX	16-bit register: The 'counter' register
DX	16-bit register: The 'data' register
BX	16-bit register: The 'base' register
SP	16-bit register: The 'stack pointer' register
BP	16-bit register: The 'base pointer' register
SI	16-bit register: The 'source index' register
DI	16-bit register: The 'destination index' register
ES	16-bit register: The 'extra' segment register
CS	16-bit register: The 'code' segment register
SS	16-bit register: The 'stack' segment register
DS	16-bit register: The 'data' segment register
1	A constant argument of 1, implicit in the opcode, and not represented elsewhere in the instruction. This argument is displayed in assembly code.
3	A constant argument of 3, implicit in the opcode, and not represented elsewhere in the instruction. This argument is displayed in assembly code.
M	The ModR/M byte refers to a memory location, however the contents of that memory location are irrelevant; the address itself is the operand of the instruction. Applicable, e.g., to LEA.

Special Argument Codes

8-bit register: The low byte of AX

8-bit register: The low byte of CX

8-bit register: The low byte of DX

8-bit register: The low byte of BX

8-bit register: The high byte of AX

8-bit register: The high byte of CX

8-bit register: The high byte of DX

8-bit register: The high byte of BX

16-bit register: The 'accumulator' register

16-bit register: The 'counter' register

16-bit register: The 'data' register

16-bit register: The 'base' register

16-bit register: The 'stack pointer' register

16-bit register: The 'base pointer' register

16-bit register: The 'source index' register

16-bit register: The 'destination index' register

16-bit register: The 'extra' segment register

16-bit register: The 'code' segment register

16-bit register: The 'stack' segment register

16-bit register: The 'data' segment register

A constant argument of 1, implicit in the opcode, and not represented elsewhere in the instruction. This argument *is* displayed in assembly code.

A constant argument of 3, implicit in the opcode, and not represented elsewhere in the instruction. This argument *is* displayed in assembly code.

The ModR/M byte refers to a memory location, however the contents of that memory location are irrelevant; the address itself is the operand of the instruction. Applicable, e.g., to LEA.