################################################## 
Tutorial Assembly for nerds using linux - part 1 
made by member Olaf of LNG - Linux security group 
email – linuxsecgroup@hotmail.com 
################################################## 

If you want to join LNG mailing list go to 
http://www.egroups.com/group/linuxsecgroup 

############ Introduction ############ 

Assembly nowadays is a hard thing to learn, not because it's difficult but 
because people thinks that there is no reason to learn assembly! That's 
not true... With assembly you can have total power above the computer, and 
know exactly what he's doing. Try to remember that while trying to learn! 
You may wondering: Why to read this tut? Good point... There are thousands 
of papers for programming assembly in x86. That's true... But did they 
teach you how to make apps for linux? Did they talked about linux 
interrupts? I don't think so... There are many tutorials about programming 
assembly for x86 in dos and windows, but very few on linux. I want to 
change that. So this is the first issue of a collection of papers about 
that. If you find that this paper has any error, please contact me and let 
me know. 

##### Index ##### 

1. Numbering systems 
1.1. Decimal system 
1.2. Binary system 
1.2.1. Converting a binary number to a decimal number 
1.2.2. Converting a decimal number to a binary number 
1.3. Hexadecimal system 
1.4. Conventions 
2. Binaries in computers 
2.1. Bit 
2.2. Nibble 
2.3. Byte 
2.4. Word 
2.5. Double word 

###################### 1. Numbering systems ###################### 

1.1. Decimal system 

Nowadays we use the decimal numbering system in almost everything that is 
related to numbers. We use it so often and in a natural way that we forget 
it's meaning. What is decimal system? 

. Every decimal number, has only digits between zero and nine, 
making a total of 10 digits 
Note: how many fingers do you have? ...10. In fact the decimal 
system is bound to human anatomy. 
. Ok, and what is the meaning of each digit? Consider the 
following numbers: 234 and 234,43 
We do some transformations 
-> 234 i.e. 200 + 30 + 4 i.e. 2 * 10^2 + 3 * 10^1 + 4 * 10^0 
-> 234,43 = 2 * 10^2 + 3 * 10^1 + 4 * 10^0 + 0,43 = 2 * 10^2 + 3 * 
10^1 + 4 * 10^0 + 4 * 10^-1 + 3 * 10^-2 
Do you see the relation? Each digit appearing to the left of the 
decimal point represents a value between zero and nine times an increasing 
power of ten. Digits appearing to the right of the decimal point represent 
a value between zero and nine times a decreasing power of ten. 

1.2. Binary system 

Binary system uses only two digits, by convention the digits are 0 and 1. 
This system is so widely used in computers... By coincidence or not this 
system adjusts perfectly to computers... Computers operate using binary 
logic. The computer represents values using two different voltage levels, 
in this way we can represent 0 and 1. Like I said before the same applies 
to binary system, it is well adjust to computer anatomy! 

1.2.1. Converting a binary number to a decimal number 
Apply the same rule we saw in 1.1, but with powers of two. 
Example: 1010 -> 1 * 2^3 + 0 * 2^2 + 1 * 2^1 + 0 * 2^0 = 10 
1.2.2. Converting a decimal number to a binary number 

We have two ways to do it: 

1.2.2.1 We consecutively divide the decimal value by a power 
two(keeping the remainder), while the result of the division is different 
than zero. The binary representation is obtained by the sequence of 
remainders in the inverse order of the divisions. 

Consider the number 10(in decimal): 

10 / 2 
0 5 / 2 
1 2 / 2 
0 1 / 2 
1 0 

So in binary we write 1010 

1.2.2.2 You can try to find out the number by adding powers of two, 
that added will produce the decimal result. 

Consider for example number 123... hmmm it's a number not less than 2^0 
and not greater then 2^7. Cool… 

2^7 2^6 2^5 2^4 2^3 2^2 2^1 2^0 
0 1 1 1 1 0 1 1 because 1 * 2^6 + 1 * 2^5 + 1 * 
2^4 + 1 * 2^3 + 0 * 2^2 + 1*2^1 + 1 * 2^0 
= 123 
Our result is 1111011. 

1.3. Hexadecimal system 

You saw how many digits took to represent the number 123 in binary. 7 
digits! Imagine 1200, 10000,... it hurts. So programmers had to choose 
another numbering system, just to "talk" to the machine... and no... it's 
not the decimal system!! You saw the trouble we had to convert one simple 
number like 10 between decimal and binary... I think you don't want to 
spend half of your life doing that. Engineers thought on that and they 
elected the hexadecimal system... Hexadecimals is the "english" for 
computers. They have two special features: 
- They're very compact 
- it's simple to convert them to binary and vice-versa. A 
hexadecimal number has digits with a value between 0 and 15 times a 
certain power of sixteen. Because we only know digits between 0-9 we have 
to use six more digits! We can use the 6 first letters of the alphabet. 
Let's see a example: FF = 15 * 16^1 + 15 * 16^0 = 255 (16) (10) 

Converting between binary and hexadecimal is very easy! To convert binary 
to hexadecimal remember that every four digits correspond to a single 
hexadecimal digit... to convert back to binary just apply the inverse 
rule! Let's take a look at the next example: 

110 1011 = 0110 1011 =6B 
(2) (16) 

It's very easy!! To make things easier, take a look at the following 
table: 

################ 
# D # H # B # 
################ 
# 0 # 0 # 0000 # 
# 1 # 1 # 0001 # 
# 2 # 2 # 0010 # 
# 3 # 3 # 0011 # 
# 4 # 4 # 0100 # 
# 5 # 5 # 0101 # 
# 6 # 6 # 0110 # 
# 7 # 7 # 0111 # 
# 8 # 8 # 1000 # 
# 9 # 9 # 1001 # 
#10 # A # 1010 # 
#11 # B # 1011 # 
#12 # C # 1100 # 
#13 # D # 1101 # 
#14 # E # 1110 # 
#15 # F # 1111 # 
################ 

1.4. Conventions 

Programming in assembly, requires you to obey some rules when using 
numbers, because you can use three different numbering systems. 

When writing a number: 

- all numbers have to start with a decimal digit 
- all numbers end with a letter, indicating the type of number: 
. for hexadecimals the letter is h 
. binary numbers end with b 
. decimals end with t or d We will use the following notation: 

Xn Xn-1 ... X2 X1 -> Xi represents a bit, and i<-[0,1,...,n] represents 
it's position. 

######################## 2. Binaries in computers ######################## 

2.1. Bit is the abbreviation to binary digit. We'll use this abbreviation 
all the time. As you may have guessed that “bit” is the smallest unit of 
data on a binary computer. 

2.2. Nibble is a set of four bits. For example 1110, 1101, 1111 is a 
nibble... every thing that has 4 bits is a nibble. 
X3 X2 X1 X0 -> X1 X0 are the low order bits in the nibble, while 
X2X3 are the high order bits. 

2.3. Byte -> This is the most used data structure in computers. A byte is 
a collection of eight bits: 
X7 X6 X5 X4 X3 X2 X1 X0 When referring to a byte we say that it's 
low order nibble is X0 X1 X2 X3, while the high order nibble is X4 X5 X6 
X7. Note: bytes are used to represent characters using the ASCII 
character set. 

2.4. A word consists of 2 bytes, i.e. a group of 16 bits. In a byte you 
have a low order byte and high order byte. 

2.5. A double word consists of 2 words, i.e. 4 bytes i.e. 32 bits. In a 
double word you have a L.O. word and a H.O. word. 

In the next tutorial I will talk about: 

- Arithmetic operations - Signed and unsigned numbers - System 
Organization - Flags and registers - Logical operations on bits - Some 
instructions like: mov, jmp, call, and...