Updated May 10, 2021: Hey there friend! This ever-green blog is gold – just like reaktorians. If you’re reading this, you might be one of us. The30th

Crash course to Amiga assembly programming - Reaktor

Updated May 10, 2021: Hey there friend! This ever-green blog is gold – just like reaktorians. If you’re reading this, <a href="https://www.reaktor.com/careers/">you might be one of us</a>.

The&nbsp;30th anniversary of Amiga inspired me to dig into Amiga programming. Back in Amiga’s golden era (late 80’s and early 90’s) I never had the chance to try this out since despite my relentless whining my parents wouldn’t get me one. Luckily later when I was studying at the uni, I managed to bargain one fine Amiga 500 specimen from the flea market at an affordable price of 20 euros.

Although Amiga as such is not that useful a platform to know these days, learning how to write programs for it can be very educational. Amiga as an environment is much simpler than (for instance) modern PCs. This makes learning low-level programming on it faster than on more complex environments. Although the hardware architecture is quite simple, it has some computer system design features that are still in use in modern environments as well such as <a href="https://en.wikipedia.org/wiki/Direct_memory_access">DMA</a> and <a href="https://en.wikipedia.org/wiki/Interrupt">interrupts</a>. On top of being plain fun, writing assembly on Amiga teaches programming concepts that are usually hidden by higher-level languages and modern operating systems.

I’ve written this blog post together with Harri Salokorpi. We’ll walk you through an example that creates graphics on the display with a simple animation. We both hope this blog post provides a quick start to those who want to try out programming on this legendary device. However, we’re mostly going to use an emulator as a development environment, so the real device is not mandatory.

As said, we are going to use the&nbsp;<a href="http://fs-uae.net/">FS-UAE Amiga Emulator</a> to run our compiled Amiga software. On top of installing the emulator you’re also going to need at least one Kickstart ROM image and a Workbench floppy disk image. FS-UAE can emulate multiple Amiga models, but for this blog post we are using Amiga 500 with kickstart 1.3 ROM image and Workbench 1.3. You can get plenty of ROM images and Workbench disk images from <a href="http://www.amigaforever.com/">Amiga Forever</a> bundle. There’s for instance Forever Plus Edition which provides required ROM images, Workbench disk images and some additional goodies like games and demos.

In order to get the emulator to boot up so that we can effectively run and debug our software, we need to configure it properly. FS-UAE takes a configuration file (an ini-file) as a command line parameter like this: <code>fs-uae configuration.ini</code>. Here is an example configuration file:

<code>floppy_drive_0</code> will instruct FS-UAE to use the specified ADF-file (<a href="https://en.wikipedia.org/wiki/Amiga_Disk_File">Amiga Disk File image</a>) as the disk inserted into the first floppy disk drive. <code>hard_drive_0</code> enables you to map any directory on the host system as a hard drive in the booted up Amiga system. This is useful during development since it allows us to edit and compile on the host device and access the compiled binary directly from the emulator. I have made symbolic links in <code>$HOME/amigahd</code> to all directories in my host environment where I have Amiga binaries, so that I can easily access them from the emulated system. <code>kickstart_file</code> tells which ROM image file to use when booting up the Amiga emulator.

FS-UAE comes with a handy debugger that you can use on your host system while your software is running in the emulator. To enable the debugger you need to specify <code>console_debugger = 1</code> in the configuration file.

Now if you run <code>fs-uae configuration.ini</code> with appropriate ROM image files and Workbench disk images the emulator should boot up to specified ROM image and start loading the Workbench from the floppy disk image (with the familiar floppy disk drive sounds and everything). Eventually you should be greeted with an UI that looks something like this:

Next, we are going to set up our host-side cross-assembler which will eat our assembly code and spit out Amiga <a href="https://en.wikipedia.org/wiki/Motorola_68000">m68k</a> binaries. <a href="http://sun.hasenbraten.de/vasm/">Vasm</a> runs on a number of platforms and is capable to produce not only m68k binaries but object files for multiple other platforms as well. You can compile vasm to target m68k architecture and mot syntax by running <code>make CPU=m68k SYNTAX=mot</code>. Resulting binary <code>vasmm68k_mot</code> can now be used to compile assembly code that can be directly run on Amiga machine.

Let’s introduce some source code. Our example can be cloned from <a href="https://github.com/uhef/amiga-assembly-crashcourse">GitHub</a>. Run <code>vasmm68k_mot -kick1hunks -Fhunkexe -o example -nosym source.asm</code> in the root of the cloned project to compile the example. This will produce <code>example</code> binary in the same folder. Running <code>file example</code> will give you:

myname@mycomputer ~/w/amiga-quickstart&gt; file example

example: AmigaOS loadseg()ble executable/binary

We have our first Amiga executable! A few words on the arguments given to vasm: <code>-kick1hunks</code> will produce binary that is compatible with kickstart 1.x systems. <code>-hunkexe</code> generates executable object code. <code>-nosym</code> strips local symbols from the binary. More information about vasm and its features can be found in the <a href="http://sun.hasenbraten.de/vasm/release/vasm.html">documentation</a>.

Note: The <code>-kick1hunks</code> option is only needed for Kickstart 1.3 equipped Amigas. This means most of the Amiga 500, Amiga 1000 and Amiga 2000 computers, unless they have had their kickstart chips upgraded. Amiga 1200, Amiga 600, Amiga 500+ and Amiga 3000 have more modern Kickstart 2.0 (or 3.0/3.1) built in.

We are ready to run the software! Let’s do just that. However, let’s first introduce a handy tool, the FS-UAE debugger. It can be run in the console of the host system while the code is running in the emulator. This can be enabled by the <code>console_debugger = 1</code> declaration in the FS-UAE configuration file. In order for this to work the emulator needs to be run from console.

Once you have the emulator up and running, double click on the Workbench icon on the desktop. A new window is opened containing icon for shell. Double click that to start the shell. You are now in <a href="https://en.wikipedia.org/wiki/AmigaDOS">AmigaDOS</a>. Activate FS-UAE console debugger by pressing <code>F12-d</code>. This will freeze the emulator and open debugger in the console in which you started FS-UAE. Once started the debugger will show useful information such as current contents of the CPU registers and next program counter. Something like this:

You can type <code>?&lt;Enter&gt;</code> to list debugger commands. Run <code>fp "example"&lt;Enter&gt;</code> to tell the debugger to break execution when process called “example” is being run. This command will let the emulator run again, so you can now navigate with the AmigaDOS to the location where compiled example is stored and run it. In my case, where the directory called <code>amiga-quickstart</code> contains the amiga binary and there’s a symbolic link to the directory in a directory called <code>amigahd</code> which is shared with the emulator, the process is like this:

Once you execute the binary, the breakpoint will trigger and and the control will be returned to debugger where you should be greeted with something like this:

Row above <code>Next PC:</code> shows the instruction that would be executed next. If you compare that instruction with the first code instruction in the example source code <code>source.asm</code> which says <code>move.w DMACONR,d0</code> you should notice a remarkable similarity. <code>DMACONR</code> is just alias for address <code>$dff002</code> which is defined on the first line of the source code file: <code>DMACONR EQU $dff002</code>. We are now executing the first line of our application in a debugger. If you type <code>d&lt;Enter&gt;</code> in the debugger you will notice even more lines that look the same in debugger output and in <code>source.asm</code>. d stands for disassembly, and without parameters it will disassemble 10 consecutive lines starting from the next program counter address.

Type <code>g&lt;Enter&gt;</code> for “go” so that emulator will kick back alive and you can witness the marvellous animated graphics running in the emulator!

Let’s explore the source code. First block of <code>source.asm</code> looks like this:

This block of code is proudly copied from <a href="http://vikke.net/index.php?id=copperbars-1">copperbars example</a> of AmigaVikke, and it stores values in certain hardware registers to memory in order to restore them back to the same hardware registers once execution of the program completes. As you probably could guess,&nbsp;<code>move</code> – instruction moves data from memory to CPU registers and back. <code>or</code> – instruction unsurprisingly does a bitwise or on the register value. Its important to notice that the <code>.w</code> suffix on the instruction denotes the size of the data the operation works on. There are three variants: <code>.b</code> for byte-sized operations, <code>.w</code> for word-sized operations (16 bits) and <code>.l</code> for long-word operations (32 bits). Good coverage of m68k instructions and their operands can be found from the <a href="http://www.freescale.com/files/archives/doc/ref_manual/M68000PRM.pdf">Programmer’s Reference Manual</a>.

As mentioned before,&nbsp;<code>DMACONR</code>, <code>INTENAR</code> and others are just aliased memory addresses to certain hardware registers. Reading from and writing to hardware registers that reside in memory space called chip mem (for chip memory) gives possibility to control Amiga’s custom chipset from CPU. These addresses all start with <code>$dff</code> and have 12 bits to denote the actual register. You can see the addresses the aliases point to at the beginning of the source file. For instance <code>DMACONR</code> aliases address <code>$dff002</code>.

You can find documentation of all Amiga hardware registers in <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node0060.html">Amiga hardware guide</a>.

<code>olddmareq</code> and other memory locations to where the data from hardware registers is stored are defined at the end of the source code file in data block definitions. For instance <code>olddmareq: dc.w 0</code> declares (dc for declare) a word-sized block of memory that is initialized to zero and assigns label <code>olddmareq</code> to it.

Amiga has a support for dynamically loadable libraries. Every Amiga system also comes with a set of predefined utility libraries. Libraries are opened using <a href="http://amigadev.elowar.com/read/ADCD_2.1/Includes_and_Autodocs_2._guide/node0367.html">OpenLibrary</a> function which itself is stored in a library called <code>exec.library</code>. Calling <code>OpenLibrary</code> provides caller with base address to the opened library. All functions within that library can be accessed by jumping to addresses relative to the base address of the library. As said, libraries are opened using an <code>exec.library</code> which is a library itself. Jump table of every other library can be loaded to any memory address (that’s why they need to be accessed using <code>OpenLibrary</code>). <code>exec.library</code> base address is always stored in memory address <code>$4</code>.

In the code block above we call <code>OpenLibrary</code> of <code>exec.library</code> by loading library base address from memory address <code>$4</code> to register <code>a6</code> and jumping to function which relies in address that is stored in relative position <code>-552</code> from the base address using <code>jsr</code> instruction. We provide the name of the library in register <code>a1</code> (<code>#gfxname</code> points to memory address that contains string <code>graphics.library</code>) and version in register <code>d0</code>. Zero means that we are fine with whatever version of the library the system can provide. <code>OpenLibrary</code> returns base address of the <code>graphics.library</code> in register <code>d0</code>. Effectively, we have now opened the graphics library and we can use the functions it provides until we close the library.

Note: <code>LoadLibrary</code> minimum version of the library required is mostly relevant with Kickstart 2.0, 2.1, 3.0 and 3.1 based systems and with AGA-chipset Amigas.

Last two instructions above copy old system view and copper list pointers to a memory location. The offsets used refer to internal structure of the Amiga graphics library – see <code>struct View</code> <a href="http://amigadev.elowar.com/read/ADCD_2.1/Includes_and_Autodocs_3._guide/node05ED.html">here</a>. This is not a very future-proof way to do this, but Amiga has almost always allowed direct access to internal system structures (due to missing MMU and memory protection), and sometimes there’s no system friendly way to read such data.

You can find information about libraries and the functions they provide in <a href="http://amigadev.elowar.com/read/ADCD_2.1/Libraries_Manual_guide/node0290.html">Libraries Manual Guide</a>. Library function base address offsets are included in Amiga Developer Docs’ <a href="http://amigadev.elowar.com/read/ADCD_2.1/Includes_and_Autodocs_2._guide/node0550.html">Includes and Autodocs</a>.

In this code block several library calls are made as outlined in the discussion of the previous block. First, the current view is removed by calling <a href="http://amigadev.elowar.com/read/ADCD_2.1/Includes_and_Autodocs_2._guide/node0459.html">LoadView</a> with parameter 0 (value of register <code>a1</code>. Next, we wait for vertical blank (for the next video frame to commence). Finally, we call <a href="http://amigadev.elowar.com/read/ADCD_2.1/Includes_and_Autodocs_2._guide/node0353.html">Forbid()</a> in <code>exec.library</code> which is a rather powerful call to disable multitasking by preventing other tasks from being scheduled.

<code>WaitTOF</code> is called twice in case the system display uses interlaced mode. In <a href="http://amigadev.elowar.com/read/ADCD_2.1/Libraries_Manual_guide/node0316.html">interlaced mode</a> the system has two copper lists (odd and even frame) so it might take up to 2 screen draws until our copper list is properly loaded.

This block should seem familiar after reading about the first setup block. In the first setup block we read data from hardware registers (the ones starting <code>$dff</code>) and stored the values to memory. Here we write values to hardware registers in order to setup the system the way we want.

This block sets Amiga up to show 320 x 200 graphics with 8 colors.

Amiga uses <a href="https://en.wikipedia.org/wiki/Bit_plane">bitplanes</a> to draw graphics. A bitplane contains a single bit per pixel on the screen. By using three bitplanes we can display 8 different colors simultaneously. Amiga supports use of up to five bitplanes. First instruction in the block writes <code>$3200</code> to address <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node0022.html"> BPLCON0</a> to setup the system with three bitplanes.

Amiga also allows usage of two simultaneous playfields. Each playfield can leverage its own bitplanes (up to three bitplanes each). These playfields are displayed on top of each other and can be moved independently of each other. On the second line we set horizontal scroll of both playfields to zero. Later in the software we shall create the animation by modifying this horizontal scroll value. There’s more about playfields <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node0078.html">here</a>.

On the third and fourth line we specify bitplane modules for both odd and even bitplanes using hardware registers <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node0021.html"> <code>BPL1MOD</code> and <code>BPL2MOD</code></a>. We can instruct Amiga to jump a number of bytes in memory once it has read a line of bitplane data from memory in order for it to read the next line of bitplane data. In our case the bitplane data is tightly packed, so that for each raster line there’s always all three bitplanes one after each other. This means that we need to jump 80 bytes (<code>$0050</code> in hexadecimal) when we reach the end of line in order to reach the next line of the same bitplane data: each 320 pixel line of one bitplane data takes 320/8 = 40 bytes, and since there are total of three bitplanes, there’s always two bitplanes worth of data after a raster line of a given bitplane.

<a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node002E.html"> DIWSTRT and DIWSTOP</a> registers control the size of the display window size. <code>DIWSTRT</code> controls the top-left corner of the window and <code>DIWSTOP</code> controls the bottom-right corner of the window.

<a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node002C.html"> DDFSTRT and DDFSTOP</a> control the horizontal timing when bitplane data fetch is started and stopped. As you can see from the documentation we use the “normal” setting for both. These timings are wait times (in cycles) from the start of the row, and from the end of the row.

These register values define the size and position of the display onscreen. This is related to <a href="http://amigadev.elowar.com/read/ADCD_2.1/Libraries_Manual_guide/node0314.html"> overscan</a> which is relevant with old analog display technologies where the display area and resolution are not clearly defined. Modifying these values allows us to stretch the viewport to cover all the screen area. With Amiga hardware, this also increases the number of pixels displayed. The maximum overscan for OCS chipset is in lores-mode <a href="http://ada.untergrund.net/?p=boardthread&amp;id=923">roughly 352×286</a>, and it may require <a href="http://ada.untergrund.net/?p=boardthread&amp;id=890">some trickery</a> to achieve.

The lines commented <code>DMA set ON</code> and <code>DMA set OFF</code> use <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node002F.html"> DMACON</a> to setup the direct memory access for the software. We only need copper and bitplane support in DMA, so everything else is explicitly disabled.

Last two lines use <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node0036.html"> INTENA</a> to setup interrupts.

Amiga chipset comes bundled with three chips: Agnus, Denise and Paula. Agnus contains a general purpose coprosessor (or “Copper” for short) which handles big part of the graphics on an Amiga system. Copper has its own instruction set which can be used to write programs running on the copper. Copper instruction set contains only three different instructions: <code>WAIT</code>, <code>MOVE</code> and <code>SKIP</code>. The main loop of the example generates a program for copper (called copper list) on each frame and assigns it to execution.

This first block increases frame counter and fetches pointer into which we will write the copper list. Frame counter is needed in order for us to create an animation. The ongoing frame is stored in register <code>d1</code> during the mainloop. <code>#copper</code> points to memory block that we have reserved in chip mem for storing the copper list in. All data accessed by copper needs to be allocated in memory segment that is available to the custom chip set. If you take a look at the end of code listing you will see that <code>copper</code> data block is declared in a segment that is marked <code>ChipRAM</code>. Address pointer to copper list is stored in register <code>a6</code> which is incremented when we build up the copper list by adding instructions to it.

Copper <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node004A.html">move instruction</a> moves data to hardware registers. The instruction is encoded so that first word of the instruction denotes the hardware register address to which data is moved and the second word of the instruction contains the data to be moved. Since all hardware registers have the same 12 highest bits (<code>dff</code>) the first word of the instruction takes only last 12 bits of the destination register as a parameter. So for instance when we add move instruction <code>$00e2</code> to copper list we want copper to move the data to register <code>dff0e2</code>.

This block (and the two blocks following it) assign our bitplane data to bitplane hardware registers so that the hardware will draw the graphics from bitplanes contained in memory. Bitplane data is communicated to hardware through two registers: <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node0024.html">high and low pointer</a>. Low pointer contains the lowest 15 bits of the bitplane data address and high pointer contains highest 3 bits. <code>swap</code> instruction swaps the 16-bit halves of the 32-bit register so that the low bits become high bits and vice versa. This allows us to store the low bits to low pointer register and high bits to high pointer register. Notice that we are always moving just 16 bits of data to copper list (by using <code>.w</code> version of the <code>move</code> instruction). <code>move.w</code> instruction with the <code>(a6)+</code> addressing scheme moves data to address pointed to by <code>a6</code> and increments the address in <code>a6</code> by word-size.

This block adds a bunch of move instructions into the copper list. These move instructions define the color palette used while drawing the graphics. Since we are using three bitplanes we have in total 3 bits to control used color which adds up to 8 distinct colors. Colors are defined in hardware registers <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node0027.html"><code>COLOR00</code> through <code>COLOR31</code></a>. Each color is defined using 12 bits (4 blue, 4 green and 4 red).

This block animates the graphics on the screen. We introduce different horizontal displacement per scan line and animate the displacement value by frame counter in order to create an effect where the graphics continuously flutter.

We achieve this effect by inserting values into <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node0022.html"> <code>BPLCON1</code></a> register. Copper is instructed to perform the horizontal displacement assignment using move instruction with calculated displacement stored in <code>d5</code> – register. This is done by these two rows above:

During each frame the graphics area is divided into 32 bands. Each of these bands have a different horizontal displacement. We instruct the copper to wait until scan line of next band is reached, at which time we assign new horizontal displacement value into <code>BPLCON1</code> – register. Copper <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node004B.html"> <code>WAIT</code></a> – instruction is formed using value in <code>d1</code> register where the lower 8 bits are always <code>$07</code> (this denotes that the horizontal beam position should be at the beginning of the scan line) and the higher 8 bits are the beam vertical position (i.e. the scan line). Wait instruction is inserted into copper list by these two instructions:

Once displacement of a given band is assigned, we wait until the next band. The vertical beam position of the next band is calculated by adding 5 to the vertical position of the previous band: <code>160 / 32 = 5</code> where 160 is the height of the graphics and 32 the number of bands. Thus the next wait position is calculated by this instruction:

<code>sin32_15</code> contains a pre-calculated table of sine values (32 samples with values from 0 to 15). Horizontal displacement value is calculated from these sine values. We fetch a sine value for each band by taking modulo 32 of the frame counter and using the resulted value as an index to the sine table:

Amiga supports hardware horizontal scroll on 4-bit resolution (thus the 0 to 15 sine value). To correctly scroll the graphics, we need to use the same displacement value on both playfields one and two. This is achieved by having the same displacement value on bits 1-4 and 5-8:

The copper list for the frame is now complete. To conclude the copper list, we need to mark the end of the list. This move instruction is interpreted as the end of the built copper list:

Amiga system contains two complex interface adapters (CIA). These adapters are used to communicate with the outside world using input/output devices such as joystick. Data from those devices can be read from a set of <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node017E.html"> chip registers</a>. The <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node012E.html"> <code>CIAAPRA</code></a> register can be used to monitor if joystick button is pressed. On every iteration of the main loop, we check if joystick (or mouse) button is pressed and, in this case, exit the program. Following code checks both bit 6 and 7 on <code>CIAAPRA</code> – register and branches to exit if either one is set. Bit 6 is set if joystick (or mouse) in port 1 has its button pressed. Bit 7 is set if joystick (or mouse) in port 2 has its button pressed.

We need to wait for vertical blanking period before we can assign our copper list to execution. The vertical blanking time starts at scan line 300. To detect whether vertical blanking is on, we peek values in <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node018A.html"> <code>VPOSR</code></a> – register. We are interested only in vertical position so we mask everything else out (using mask <code>#$1ff00</code> which leaves only the 9 bits we care for).

After this we check if scan line 300 is reached:

We could have bit shifted the value down in register <code>d0</code> and then compared it against <code>#300</code>. Instead we use assembly constant trick where we shift the <code>#300</code> with <code>&lt;&lt;8</code> operator. What this says is “give me #300 bit shifted right 8 bits”. This saves us an instruction but might look a bit confusing at first.

Note: Amiga has also support for vertical blank interrupts. However, interrupts are a bit trickier to program, so to keep it simple, we stick with the busy loop implementation.

Once vertical blank period is reached we assign our copper list into execution by assigning the address of the copper list to registers <a href="http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node0028.html"> <code>COP1LCH</code> and <code>COP1LCL</code></a>. Notice that by writing long word to location <code>COP1LCH</code> the lower word will leak into <code>COP1LCL</code> and thus set both high and low bits of the copper list. Once that is done we execute the next iteration of the mainloop:

On application exit, we must restore some registers back to defaults (interrupt control, most importantly), load back the old view and copperlist and enable multitasking again.