]> git.proxmox.com Git - mirror_qemu.git/blame - qemu-doc.texi
increased physical RAM limit to 2047 MB in soft MMU mode
[mirror_qemu.git] / qemu-doc.texi
CommitLineData
386405f7
FB
1\input texinfo @c -*- texinfo -*-
2
0806e3f6 3@iftex
322d0c66 4@settitle QEMU CPU Emulator Reference Documentation
386405f7
FB
5@titlepage
6@sp 7
322d0c66 7@center @titlefont{QEMU CPU Emulator Reference Documentation}
386405f7
FB
8@sp 3
9@end titlepage
0806e3f6 10@end iftex
386405f7
FB
11
12@chapter Introduction
13
322d0c66 14@section Features
386405f7 15
1eb20527
FB
16QEMU is a FAST! processor emulator. By using dynamic translation it
17achieves a reasonnable speed while being easy to port on new host
18CPUs.
19
20QEMU has two operating modes:
0806e3f6
FB
21
22@itemize @minus
23
24@item
25User mode emulation. In this mode, QEMU can launch Linux processes
1eb20527
FB
26compiled for one CPU on another CPU. Linux system calls are converted
27because of endianness and 32/64 bit mismatches. The Wine Windows API
28emulator (@url{http://www.winehq.org}) and the DOSEMU DOS emulator
2be3bc02 29(@url{http://www.dosemu.org}) are the main targets for QEMU.
1eb20527 30
0806e3f6
FB
31@item
32Full system emulation. In this mode, QEMU emulates a full
1eb20527
FB
33system, including a processor and various peripherials. Currently, it
34is only used to launch an x86 Linux kernel on an x86 Linux system. It
35enables easier testing and debugging of system code. It can also be
36used to provide virtual hosting of several virtual PCs on a single
37server.
38
39@end itemize
40
41As QEMU requires no host kernel patches to run, it is very safe and
42easy to use.
322d0c66
FB
43
44QEMU generic features:
386405f7
FB
45
46@itemize
47
1eb20527
FB
48@item User space only or full system emulation.
49
50@item Using dynamic translation to native code for reasonnable speed.
386405f7 51
322d0c66 52@item Working on x86 and PowerPC hosts. Being tested on ARM, Sparc32, Alpha and S390.
386405f7 53
1eb20527
FB
54@item Self-modifying code support.
55
d5a0b50c 56@item Precise exceptions support.
386405f7 57
1eb20527
FB
58@item The virtual CPU is a library (@code{libqemu}) which can be used
59in other projects.
60
61@end itemize
62
63QEMU user mode emulation features:
64@itemize
386405f7
FB
65@item Generic Linux system call converter, including most ioctls.
66
67@item clone() emulation using native CPU clone() to use Linux scheduler for threads.
68
322d0c66 69@item Accurate signal handling by remapping host signals to target signals.
1eb20527
FB
70@end itemize
71@end itemize
df0f11a0 72
1eb20527
FB
73QEMU full system emulation features:
74@itemize
285dc330 75@item QEMU can either use a full software MMU for maximum portability or use the host system call mmap() to simulate the target MMU.
322d0c66
FB
76@end itemize
77
78@section x86 emulation
79
80QEMU x86 target features:
81
82@itemize
83
84@item The virtual x86 CPU supports 16 bit and 32 bit addressing with segmentation.
1eb20527 85LDT/GDT and IDT are emulated. VM86 mode is also supported to run DOSEMU.
322d0c66 86
1eb20527 87@item Support of host page sizes bigger than 4KB in user mode emulation.
df0f11a0
FB
88
89@item QEMU can emulate itself on x86.
1eb87257 90
386405f7
FB
91@item An extensive Linux x86 CPU test program is included @file{tests/test-i386}.
92It can be used to test other x86 virtual CPUs.
93
94@end itemize
95
df0f11a0 96Current QEMU limitations:
386405f7
FB
97
98@itemize
99
386405f7
FB
100@item No SSE/MMX support (yet).
101
102@item No x86-64 support.
103
df0f11a0 104@item IPC syscalls are missing.
386405f7
FB
105
106@item The x86 segment limits and access rights are not tested at every
1eb20527 107memory access.
386405f7
FB
108
109@item On non x86 host CPUs, @code{double}s are used instead of the non standard
11010 byte @code{long double}s of x86 for floating point emulation to get
111maximum performances.
112
285dc330 113@item Some priviledged instructions or behaviors are missing, especially for segment protection testing (yet).
1eb20527 114
386405f7
FB
115@end itemize
116
322d0c66
FB
117@section ARM emulation
118
119@itemize
120
121@item ARM emulation can currently launch small programs while using the
122generic dynamic code generation architecture of QEMU.
123
124@item No FPU support (yet).
125
126@item No automatic regression testing (yet).
127
128@end itemize
129
0806e3f6
FB
130@section SPARC emulation
131
132The SPARC emulation is currently in development.
133
5b9f457a
FB
134@chapter Installation
135
136If you want to compile QEMU, please read the @file{README} which gives
137the related information. Otherwise just download the binary
138distribution (@file{qemu-XXX-i386.tar.gz}) and untar it as root in
139@file{/}:
140
141@example
142su
143cd /
144tar zxvf /tmp/qemu-XXX-i386.tar.gz
145@end example
146
d5a0b50c 147@chapter QEMU User space emulator invocation
386405f7 148
d691f669
FB
149@section Quick Start
150
386405f7 151In order to launch a Linux process, QEMU needs the process executable
d691f669
FB
152itself and all the target (x86) dynamic libraries used by it.
153
154@itemize
386405f7 155
d691f669
FB
156@item On x86, you can just try to launch any process by using the native
157libraries:
386405f7
FB
158
159@example
0806e3f6 160qemu-i386 -L / /bin/ls
386405f7
FB
161@end example
162
d691f669
FB
163@code{-L /} tells that the x86 dynamic linker must be searched with a
164@file{/} prefix.
386405f7 165
5b9f457a 166@item Since QEMU is also a linux process, you can launch qemu with qemu (NOTE: you can only do that if you compiled QEMU from the sources):
1eb87257
FB
167
168@example
0806e3f6 169qemu-i386 -L / qemu-i386 -L / /bin/ls
1eb87257 170@end example
386405f7 171
d691f669 172@item On non x86 CPUs, you need first to download at least an x86 glibc
5b9f457a 173(@file{qemu-runtime-i386-XXX-.tar.gz} on the QEMU web page). Ensure that
644c433c
FB
174@code{LD_LIBRARY_PATH} is not set:
175
176@example
177unset LD_LIBRARY_PATH
178@end example
179
180Then you can launch the precompiled @file{ls} x86 executable:
181
d691f669 182@example
285dc330 183qemu-i386 tests/i386/ls
168485b7 184@end example
285dc330 185You can look at @file{qemu-binfmt-conf.sh} so that
168485b7
FB
186QEMU is automatically launched by the Linux kernel when you try to
187launch x86 executables. It requires the @code{binfmt_misc} module in the
188Linux kernel.
189
1eb87257
FB
190@item The x86 version of QEMU is also included. You can try weird things such as:
191@example
0806e3f6 192qemu-i386 /usr/local/qemu-i386/bin/qemu-i386 /usr/local/qemu-i386/bin/ls-i386
1eb87257
FB
193@end example
194
168485b7
FB
195@end itemize
196
df0f11a0 197@section Wine launch
168485b7
FB
198
199@itemize
200
201@item Ensure that you have a working QEMU with the x86 glibc
202distribution (see previous section). In order to verify it, you must be
203able to do:
204
205@example
0806e3f6 206qemu-i386 /usr/local/qemu-i386/bin/ls-i386
168485b7
FB
207@end example
208
fd429f2f 209@item Download the binary x86 Wine install
1eb87257 210(@file{qemu-XXX-i386-wine.tar.gz} on the QEMU web page).
168485b7 211
fd429f2f 212@item Configure Wine on your account. Look at the provided script
168485b7
FB
213@file{/usr/local/qemu-i386/bin/wine-conf.sh}. Your previous
214@code{$@{HOME@}/.wine} directory is saved to @code{$@{HOME@}/.wine.org}.
215
216@item Then you can try the example @file{putty.exe}:
217
218@example
0806e3f6 219qemu-i386 /usr/local/qemu-i386/wine/bin/wine /usr/local/qemu-i386/wine/c/Program\ Files/putty.exe
386405f7 220@end example
d691f669
FB
221
222@end itemize
223
224@section Command line options
225
226@example
0806e3f6 227usage: qemu-i386 [-h] [-d] [-L path] [-s size] program [arguments...]
d691f669
FB
228@end example
229
df0f11a0 230@table @option
d691f669
FB
231@item -h
232Print the help
d691f669
FB
233@item -L path
234Set the x86 elf interpreter prefix (default=/usr/local/qemu-i386)
235@item -s size
236Set the x86 stack size in bytes (default=524288)
237@end table
386405f7 238
df0f11a0
FB
239Debug options:
240
241@table @option
242@item -d
243Activate log (logfile=/tmp/qemu.log)
244@item -p pagesize
245Act as if the host page size was 'pagesize' bytes
246@end table
247
1eb20527
FB
248@chapter QEMU System emulator invocation
249
0806e3f6
FB
250@section Introduction
251
252@c man begin DESCRIPTION
253
254The QEMU System emulator simulates a complete PC. It can either boot
255directly a Linux kernel (without any BIOS or boot loader) or boot like a
256real PC with the included BIOS.
257
258In order to meet specific user needs, two versions of QEMU are
259available:
260
261@enumerate
262
263@item
285dc330 264@code{qemu-fast} uses the host Memory Management Unit (MMU) to simulate
0806e3f6
FB
265the x86 MMU. It is @emph{fast} but has limitations because the whole 4 GB
266address space cannot be used and some memory mapped peripherials
267cannot be emulated accurately yet. Therefore, a specific Linux kernel
268must be used (@xref{linux_compile}).
269
270@item
285dc330
FB
271@code{qemu} uses a software MMU. It is about @emph{two times
272slower} but gives a more accurate emulation.
0806e3f6
FB
273
274@end enumerate
275
276QEMU emulates the following PC peripherials:
277
278@itemize @minus
279@item
280VGA (hardware level, including all non standard modes)
281@item
282PS/2 mouse and keyboard
283@item
181f1558 2842 IDE interfaces with hard disk and CD-ROM support
0806e3f6
FB
285@item
286NE2000 network adapter (port=0x300, irq=9)
287@item
181f1558
FB
288Serial port
289@item
290Soundblaster 16 card
0806e3f6
FB
291@item
292PIC (interrupt controler)
293@item
294PIT (timers)
295@item
296CMOS memory
297@end itemize
298
299@c man end
300
1eb20527
FB
301@section Quick Start
302
285dc330 303Download and uncompress the linux image (@file{linux.img}) and type:
0806e3f6
FB
304
305@example
285dc330 306qemu linux.img
0806e3f6
FB
307@end example
308
309Linux should boot and give you a prompt.
310
311@section Direct Linux Boot and Network emulation
312
313This section explains how to launch a Linux kernel inside QEMU without
314having to make a full bootable image. It is very useful for fast Linux
315kernel testing. The QEMU network configuration is also explained.
1eb20527
FB
316
317@enumerate
318@item
0806e3f6
FB
319Download the archive @file{linux-test-xxx.tar.gz} containing a Linux
320kernel and a disk image.
1eb20527
FB
321
322@item Optional: If you want network support (for example to launch X11 examples), you
0806e3f6 323must copy the script @file{qemu-ifup} in @file{/etc} and configure
1eb20527 324properly @code{sudo} so that the command @code{ifconfig} contained in
0806e3f6 325@file{qemu-ifup} can be executed as root. You must verify that your host
1eb20527
FB
326kernel supports the TUN/TAP network interfaces: the device
327@file{/dev/net/tun} must be present.
328
329When network is enabled, there is a virtual network connection between
330the host kernel and the emulated kernel. The emulated kernel is seen
331from the host kernel at IP address 172.20.0.2 and the host kernel is
332seen from the emulated kernel at IP address 172.20.0.1.
333
0806e3f6 334@item Launch @code{qemu.sh}. You should have the following output:
1eb20527
FB
335
336@example
0806e3f6 337> ./qemu.sh
181f1558
FB
338Connected to host network interface: tun0
339Linux version 2.4.21 (bellard@voyager.localdomain) (gcc version 3.2.2 20030222 (Red Hat Linux 3.2.2-5)) #5 Tue Nov 11 18:18:53 CET 2003
1eb20527 340BIOS-provided physical RAM map:
4690764b
FB
341 BIOS-e801: 0000000000000000 - 000000000009f000 (usable)
342 BIOS-e801: 0000000000100000 - 0000000002000000 (usable)
1eb20527
FB
34332MB LOWMEM available.
344On node 0 totalpages: 8192
345zone(0): 4096 pages.
346zone(1): 4096 pages.
347zone(2): 0 pages.
181f1558 348Kernel command line: root=/dev/hda sb=0x220,5,1,5 ide2=noprobe ide3=noprobe ide4=noprobe ide5=noprobe console=ttyS0
4690764b
FB
349ide_setup: ide2=noprobe
350ide_setup: ide3=noprobe
351ide_setup: ide4=noprobe
352ide_setup: ide5=noprobe
1eb20527 353Initializing CPU#0
181f1558
FB
354Detected 2399.621 MHz processor.
355Console: colour EGA 80x25
356Calibrating delay loop... 4744.80 BogoMIPS
357Memory: 28872k/32768k available (1210k kernel code, 3508k reserved, 266k data, 64k init, 0k highmem)
1eb20527
FB
358Dentry cache hash table entries: 4096 (order: 3, 32768 bytes)
359Inode cache hash table entries: 2048 (order: 2, 16384 bytes)
181f1558 360Mount cache hash table entries: 512 (order: 0, 4096 bytes)
1eb20527
FB
361Buffer-cache hash table entries: 1024 (order: 0, 4096 bytes)
362Page-cache hash table entries: 8192 (order: 3, 32768 bytes)
363CPU: Intel Pentium Pro stepping 03
364Checking 'hlt' instruction... OK.
365POSIX conformance testing by UNIFIX
366Linux NET4.0 for Linux 2.4
367Based upon Swansea University Computer Society NET3.039
368Initializing RT netlink socket
369apm: BIOS not found.
370Starting kswapd
4690764b 371Journalled Block Device driver loaded
181f1558 372Detected PS/2 Mouse Port.
1eb20527
FB
373pty: 256 Unix98 ptys configured
374Serial driver version 5.05c (2001-07-08) with no serial options enabled
375ttyS00 at 0x03f8 (irq = 4) is a 16450
376ne.c:v1.10 9/23/94 Donald Becker (becker@scyld.com)
377Last modified Nov 1, 2000 by Paul Gortmaker
378NE*000 ethercard probe at 0x300: 52 54 00 12 34 56
379eth0: NE2000 found at 0x300, using IRQ 9.
4690764b 380RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize
181f1558
FB
381Uniform Multi-Platform E-IDE driver Revision: 7.00beta4-2.4
382ide: Assuming 50MHz system bus speed for PIO modes; override with idebus=xx
383hda: QEMU HARDDISK, ATA DISK drive
384ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
385hda: attached ide-disk driver.
386hda: 20480 sectors (10 MB) w/256KiB Cache, CHS=20/16/63
387Partition check:
388 hda:
389Soundblaster audio driver Copyright (C) by Hannu Savolainen 1993-1996
1eb20527
FB
390NET4: Linux TCP/IP 1.0 for NET4.0
391IP Protocols: ICMP, UDP, TCP, IGMP
392IP: routing cache hash table of 512 buckets, 4Kbytes
4690764b 393TCP: Hash tables configured (established 2048 bind 4096)
1eb20527 394NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.
4690764b 395EXT2-fs warning: mounting unchecked fs, running e2fsck is recommended
1eb20527 396VFS: Mounted root (ext2 filesystem).
181f1558
FB
397Freeing unused kernel memory: 64k freed
398
399Linux version 2.4.21 (bellard@voyager.localdomain) (gcc version 3.2.2 20030222 (Red Hat Linux 3.2.2-5)) #5 Tue Nov 11 18:18:53 CET 2003
400
401QEMU Linux test distribution (based on Redhat 9)
402
403Type 'exit' to halt the system
404
405sh-2.05b#
1eb20527
FB
406@end example
407
408@item
409Then you can play with the kernel inside the virtual serial console. You
410can launch @code{ls} for example. Type @key{Ctrl-a h} to have an help
411about the keys you can type inside the virtual serial console. In
d5a0b50c
FB
412particular, use @key{Ctrl-a x} to exit QEMU and use @key{Ctrl-a b} as
413the Magic SysRq key.
1eb20527
FB
414
415@item
416If the network is enabled, launch the script @file{/etc/linuxrc} in the
417emulator (don't forget the leading dot):
418@example
419. /etc/linuxrc
420@end example
421
422Then enable X11 connections on your PC from the emulated Linux:
423@example
424xhost +172.20.0.2
425@end example
426
427You can now launch @file{xterm} or @file{xlogo} and verify that you have
428a real Virtual Linux system !
429
430@end enumerate
431
d5a0b50c
FB
432NOTES:
433@enumerate
434@item
0806e3f6
FB
435A 2.5.74 kernel is also included in the archive. Just
436replace the bzImage in qemu.sh to try it.
d5a0b50c
FB
437
438@item
9d4520d0 439qemu creates a temporary file in @var{$QEMU_TMPDIR} (@file{/tmp} is the
d5a0b50c
FB
440default) containing all the simulated PC memory. If possible, try to use
441a temporary directory using the tmpfs filesystem to avoid too many
442unnecessary disk accesses.
443
444@item
9d4520d0
FB
445In order to exit cleanly from qemu, you can do a @emph{shutdown} inside
446qemu. qemu will automatically exit when the Linux shutdown is done.
4690764b
FB
447
448@item
449You can boot slightly faster by disabling the probe of non present IDE
450interfaces. To do so, add the following options on the kernel command
451line:
452@example
453ide1=noprobe ide2=noprobe ide3=noprobe ide4=noprobe ide5=noprobe
454@end example
455
456@item
457The example disk image is a modified version of the one made by Kevin
1eb20527
FB
458Lawton for the plex86 Project (@url{www.plex86.org}).
459
d5a0b50c
FB
460@end enumerate
461
ec410fc9
FB
462@section Invocation
463
464@example
0806e3f6
FB
465@c man begin SYNOPSIS
466usage: qemu [options] [disk_image]
467@c man end
ec410fc9
FB
468@end example
469
0806e3f6 470@c man begin OPTIONS
9d4520d0 471@var{disk_image} is a raw hard disk image for IDE hard disk 0.
ec410fc9
FB
472
473General options:
474@table @option
2be3bc02
FB
475@item -fda file
476@item -fdb file
477Use @var{file} as floppy disk 0/1 image (@xref{disk_images}).
478
ec410fc9
FB
479@item -hda file
480@item -hdb file
181f1558
FB
481@item -hdc file
482@item -hdd file
2be3bc02 483Use @var{file} as hard disk 0, 1, 2 or 3 image (@xref{disk_images}).
1f47a922 484
181f1558
FB
485@item -cdrom file
486Use @var{file} as CD-ROM image (you cannot use @option{-hdc} and and
487@option{-cdrom} at the same time).
488
2be3bc02
FB
489@item -boot [a|b|c|d]
490Boot on floppy (a, b), hard disk (c) or CD-ROM (d). Hard disk boot is
491the default.
1f47a922 492
181f1558 493@item -snapshot
1f47a922
FB
494Write to temporary files instead of disk image files. In this case,
495the raw disk image you use is not written back. You can however force
496the write back by pressing @key{C-a s} (@xref{disk_images}).
ec410fc9
FB
497
498@item -m megs
499Set virtual RAM size to @var{megs} megabytes.
500
501@item -n script
9d4520d0 502Set network init script [default=/etc/qemu-ifup]. This script is
ec410fc9
FB
503launched to configure the host network interface (usually tun0)
504corresponding to the virtual NE2000 card.
4690764b
FB
505
506@item -initrd file
0806e3f6
FB
507Use @var{file} as initial ram disk.
508
509@item -tun-fd fd
510Assumes @var{fd} talks to tap/tun and use it. Read
511@url{http://bellard.org/qemu/tetrinet.html} to have an example of its
512use.
513
514@item -nographic
515
516Normally, QEMU uses SDL to display the VGA output. With this option,
517you can totally disable graphical output so that QEMU is a simple
518command line application. The emulated serial port is redirected on
519the console. Therefore, you can still use QEMU to debug a Linux kernel
520with a serial console.
521
522@end table
523
524Linux boot specific (does not require a full PC boot with a BIOS):
525@table @option
526
527@item -kernel bzImage
528Use @var{bzImage} as kernel image.
529
530@item -append cmdline
531Use @var{cmdline} as kernel command line
532
533@item -initrd file
534Use @var{file} as initial ram disk.
535
ec410fc9
FB
536@end table
537
538Debug options:
539@table @option
540@item -s
0806e3f6 541Wait gdb connection to port 1234 (@xref{gdb_usage}).
ec410fc9
FB
542@item -p port
543Change gdb connection port.
544@item -d
9d4520d0 545Output log in /tmp/qemu.log
ec410fc9
FB
546@end table
547
548During emulation, use @key{C-a h} to get terminal commands:
549
550@table @key
551@item C-a h
552Print this help
553@item C-a x
554Exit emulatior
1f47a922
FB
555@item C-a s
556Save disk data back to file (if -snapshot)
557@item C-a b
ec410fc9 558Send break (magic sysrq)
1f47a922 559@item C-a C-a
ec410fc9
FB
560Send C-a
561@end table
0806e3f6
FB
562@c man end
563
564@ignore
565
566@setfilename qemu
567@settitle QEMU System Emulator
568
569@c man begin SEEALSO
570The HTML documentation of QEMU for more precise information and Linux
571user mode emulator invocation.
572@c man end
573
574@c man begin AUTHOR
575Fabrice Bellard
576@c man end
577
578@end ignore
ec410fc9 579
0806e3f6 580@end ignore
1f47a922
FB
581@node disk_images
582@section Disk Images
583
584@subsection Raw disk images
585
586The disk images can simply be raw images of the hard disk. You can
587create them with the command:
588@example
589dd if=/dev/zero of=myimage bs=1024 count=mysize
590@end example
591where @var{myimage} is the image filename and @var{mysize} is its size
592in kilobytes.
593
594@subsection Snapshot mode
595
596If you use the option @option{-snapshot}, all disk images are
597considered as read only. When sectors in written, they are written in
598a temporary file created in @file{/tmp}. You can however force the
599write back to the raw disk images by pressing @key{C-a s}.
600
601NOTE: The snapshot mode only works with raw disk images.
602
603@subsection Copy On Write disk images
604
605QEMU also supports user mode Linux
606(@url{http://user-mode-linux.sourceforge.net/}) Copy On Write (COW)
607disk images. The COW disk images are much smaller than normal images
608as they store only modified sectors. They also permit the use of the
609same disk image template for many users.
610
611To create a COW disk images, use the command:
612
613@example
0806e3f6 614qemu-mkcow -f myrawimage.bin mycowimage.cow
1f47a922
FB
615@end example
616
617@file{myrawimage.bin} is a raw image you want to use as original disk
618image. It will never be written to.
619
620@file{mycowimage.cow} is the COW disk image which is created by
0806e3f6 621@code{qemu-mkcow}. You can use it directly with the @option{-hdx}
1f47a922
FB
622options. You must not modify the original raw disk image if you use
623COW images, as COW images only store the modified sectors from the raw
624disk image. QEMU stores the original raw disk image name and its
625modified time in the COW disk image so that chances of mistakes are
626reduced.
627
9d0fe224
FB
628If the raw disk image is not read-only, by pressing @key{C-a s} you
629can flush the COW disk image back into the raw disk image, as in
630snapshot mode.
1f47a922
FB
631
632COW disk images can also be created without a corresponding raw disk
633image. It is useful to have a big initial virtual disk image without
634using much disk space. Use:
635
636@example
0806e3f6 637qemu-mkcow mycowimage.cow 1024
1f47a922
FB
638@end example
639
640to create a 1 gigabyte empty COW disk image.
641
642NOTES:
643@enumerate
644@item
645COW disk images must be created on file systems supporting
646@emph{holes} such as ext2 or ext3.
647@item
648Since holes are used, the displayed size of the COW disk image is not
649the real one. To know it, use the @code{ls -ls} command.
650@end enumerate
651
0806e3f6 652@node linux_compile
4690764b
FB
653@section Linux Kernel Compilation
654
285dc330
FB
655You can use any linux kernel with QEMU. However, if you want to use
656@code{qemu-fast} to get maximum performances, you should make the
657following changes to the Linux kernel (only 2.4.x and 2.5.x were
658tested):
1eb20527 659
4690764b
FB
660@enumerate
661@item
662The kernel must be mapped at 0x90000000 (the default is
6630xc0000000). You must modify only two lines in the kernel source:
1eb20527 664
4690764b 665In @file{include/asm/page.h}, replace
1eb20527
FB
666@example
667#define __PAGE_OFFSET (0xc0000000)
668@end example
669by
670@example
671#define __PAGE_OFFSET (0x90000000)
672@end example
673
4690764b 674And in @file{arch/i386/vmlinux.lds}, replace
1eb20527
FB
675@example
676 . = 0xc0000000 + 0x100000;
677@end example
678by
679@example
680 . = 0x90000000 + 0x100000;
681@end example
682
4690764b
FB
683@item
684If you want to enable SMP (Symmetric Multi-Processing) support, you
685must make the following change in @file{include/asm/fixmap.h}. Replace
1eb20527 686@example
4690764b 687#define FIXADDR_TOP (0xffffX000UL)
1eb20527 688@end example
4690764b
FB
689by
690@example
691#define FIXADDR_TOP (0xa7ffX000UL)
692@end example
693(X is 'e' or 'f' depending on the kernel version). Although you can
694use an SMP kernel with QEMU, it only supports one CPU.
1eb20527 695
4690764b 696@item
d5a0b50c
FB
697If you are not using a 2.5 kernel as host kernel but if you use a target
6982.5 kernel, you must also ensure that the 'HZ' define is set to 100
699(1000 is the default) as QEMU cannot currently emulate timers at
700frequencies greater than 100 Hz on host Linux systems < 2.5. In
4690764b 701@file{include/asm/param.h}, replace:
d5a0b50c
FB
702
703@example
704# define HZ 1000 /* Internal kernel timer frequency */
705@end example
706by
707@example
708# define HZ 100 /* Internal kernel timer frequency */
709@end example
710
4690764b
FB
711@end enumerate
712
713The file config-2.x.x gives the configuration of the example kernels.
714
715Just type
716@example
717make bzImage
718@end example
719
720As you would do to make a real kernel. Then you can use with QEMU
721exactly the same kernel as you would boot on your PC (in
722@file{arch/i386/boot/bzImage}).
da415d54 723
0806e3f6 724@node gdb_usage
da415d54
FB
725@section GDB usage
726
727QEMU has a primitive support to work with gdb, so that you can do
0806e3f6 728'Ctrl-C' while the virtual machine is running and inspect its state.
da415d54 729
9d4520d0 730In order to use gdb, launch qemu with the '-s' option. It will wait for a
da415d54
FB
731gdb connection:
732@example
9d4520d0 733> qemu -s arch/i386/boot/bzImage -hda root-2.4.20.img root=/dev/hda
da415d54
FB
734Connected to host network interface: tun0
735Waiting gdb connection on port 1234
736@end example
737
738Then launch gdb on the 'vmlinux' executable:
739@example
740> gdb vmlinux
741@end example
742
743In gdb, connect to QEMU:
744@example
745(gdb) target remote locahost:1234
746@end example
747
748Then you can use gdb normally. For example, type 'c' to launch the kernel:
749@example
750(gdb) c
751@end example
752
0806e3f6
FB
753Here are some useful tips in order to use gdb on system code:
754
755@enumerate
756@item
757Use @code{info reg} to display all the CPU registers.
758@item
759Use @code{x/10i $eip} to display the code at the PC position.
760@item
761Use @code{set architecture i8086} to dump 16 bit code. Then use
762@code{x/10i $cs*16+*eip} to dump the code at the PC position.
763@end enumerate
764
386405f7
FB
765@chapter QEMU Internals
766
767@section QEMU compared to other emulators
768
1eb20527
FB
769Like bochs [3], QEMU emulates an x86 CPU. But QEMU is much faster than
770bochs as it uses dynamic compilation and because it uses the host MMU to
771simulate the x86 MMU. The downside is that currently the emulation is
772not as accurate as bochs (for example, you cannot currently run Windows
773inside QEMU).
386405f7
FB
774
775Like Valgrind [2], QEMU does user space emulation and dynamic
776translation. Valgrind is mainly a memory debugger while QEMU has no
1eb20527
FB
777support for it (QEMU could be used to detect out of bound memory
778accesses as Valgrind, but it has no support to track uninitialised data
d5a0b50c 779as Valgrind does). The Valgrind dynamic translator generates better code
1eb20527 780than QEMU (in particular it does register allocation) but it is closely
d5a0b50c 781tied to an x86 host and target and has no support for precise exceptions
1eb20527
FB
782and system emulation.
783
784EM86 [4] is the closest project to user space QEMU (and QEMU still uses
785some of its code, in particular the ELF file loader). EM86 was limited
786to an alpha host and used a proprietary and slow interpreter (the
787interpreter part of the FX!32 Digital Win32 code translator [5]).
386405f7 788
fd429f2f
FB
789TWIN [6] is a Windows API emulator like Wine. It is less accurate than
790Wine but includes a protected mode x86 interpreter to launch x86 Windows
791executables. Such an approach as greater potential because most of the
792Windows API is executed natively but it is far more difficult to develop
793because all the data structures and function parameters exchanged
794between the API and the x86 code must be converted.
795
1eb20527
FB
796User mode Linux [7] was the only solution before QEMU to launch a Linux
797kernel as a process while not needing any host kernel patches. However,
798user mode Linux requires heavy kernel patches while QEMU accepts
799unpatched Linux kernels. It would be interesting to compare the
800performance of the two approaches.
801
802The new Plex86 [8] PC virtualizer is done in the same spirit as the QEMU
803system emulator. It requires a patched Linux kernel to work (you cannot
804launch the same kernel on your PC), but the patches are really small. As
805it is a PC virtualizer (no emulation is done except for some priveledged
806instructions), it has the potential of being faster than QEMU. The
d5a0b50c
FB
807downside is that a complicated (and potentially unsafe) host kernel
808patch is needed.
1eb20527 809
386405f7
FB
810@section Portable dynamic translation
811
812QEMU is a dynamic translator. When it first encounters a piece of code,
813it converts it to the host instruction set. Usually dynamic translators
322d0c66 814are very complicated and highly CPU dependent. QEMU uses some tricks
386405f7
FB
815which make it relatively easily portable and simple while achieving good
816performances.
817
818The basic idea is to split every x86 instruction into fewer simpler
819instructions. Each simple instruction is implemented by a piece of C
820code (see @file{op-i386.c}). Then a compile time tool (@file{dyngen})
821takes the corresponding object file (@file{op-i386.o}) to generate a
822dynamic code generator which concatenates the simple instructions to
823build a function (see @file{op-i386.h:dyngen_code()}).
824
825In essence, the process is similar to [1], but more work is done at
826compile time.
827
828A key idea to get optimal performances is that constant parameters can
829be passed to the simple operations. For that purpose, dummy ELF
830relocations are generated with gcc for each constant parameter. Then,
831the tool (@file{dyngen}) can locate the relocations and generate the
832appriopriate C code to resolve them when building the dynamic code.
833
834That way, QEMU is no more difficult to port than a dynamic linker.
835
836To go even faster, GCC static register variables are used to keep the
837state of the virtual CPU.
838
839@section Register allocation
840
841Since QEMU uses fixed simple instructions, no efficient register
842allocation can be done. However, because RISC CPUs have a lot of
843register, most of the virtual CPU state can be put in registers without
844doing complicated register allocation.
845
846@section Condition code optimisations
847
848Good CPU condition codes emulation (@code{EFLAGS} register on x86) is a
849critical point to get good performances. QEMU uses lazy condition code
850evaluation: instead of computing the condition codes after each x86
fd429f2f 851instruction, it just stores one operand (called @code{CC_SRC}), the
386405f7
FB
852result (called @code{CC_DST}) and the type of operation (called
853@code{CC_OP}).
854
855@code{CC_OP} is almost never explicitely set in the generated code
856because it is known at translation time.
857
858In order to increase performances, a backward pass is performed on the
859generated simple instructions (see
860@code{translate-i386.c:optimize_flags()}). When it can be proved that
861the condition codes are not needed by the next instructions, no
862condition codes are computed at all.
863
fd429f2f 864@section CPU state optimisations
386405f7
FB
865
866The x86 CPU has many internal states which change the way it evaluates
867instructions. In order to achieve a good speed, the translation phase
868considers that some state information of the virtual x86 CPU cannot
869change in it. For example, if the SS, DS and ES segments have a zero
870base, then the translator does not even generate an addition for the
871segment base.
872
873[The FPU stack pointer register is not handled that way yet].
874
875@section Translation cache
876
877A 2MByte cache holds the most recently used translations. For
878simplicity, it is completely flushed when it is full. A translation unit
879contains just a single basic block (a block of x86 instructions
880terminated by a jump or by a virtual CPU state change which the
881translator cannot deduce statically).
882
df0f11a0
FB
883@section Direct block chaining
884
885After each translated basic block is executed, QEMU uses the simulated
886Program Counter (PC) and other cpu state informations (such as the CS
887segment base value) to find the next basic block.
888
889In order to accelerate the most common cases where the new simulated PC
890is known, QEMU can patch a basic block so that it jumps directly to the
891next one.
892
893The most portable code uses an indirect jump. An indirect jump makes it
894easier to make the jump target modification atomic. On some
895architectures (such as PowerPC), the @code{JUMP} opcode is directly
896patched so that the block chaining has no overhead.
897
898@section Self-modifying code and translated code invalidation
899
900Self-modifying code is a special challenge in x86 emulation because no
901instruction cache invalidation is signaled by the application when code
902is modified.
903
904When translated code is generated for a basic block, the corresponding
905host page is write protected if it is not already read-only (with the
906system call @code{mprotect()}). Then, if a write access is done to the
907page, Linux raises a SEGV signal. QEMU then invalidates all the
908translated code in the page and enables write accesses to the page.
909
910Correct translated code invalidation is done efficiently by maintaining
911a linked list of every translated block contained in a given page. Other
912linked lists are also maintained to undo direct block chaining.
913
4690764b 914Although the overhead of doing @code{mprotect()} calls is important,
df0f11a0
FB
915most MSDOS programs can be emulated at reasonnable speed with QEMU and
916DOSEMU.
917
918Note that QEMU also invalidates pages of translated code when it detects
919that memory mappings are modified with @code{mmap()} or @code{munmap()}.
386405f7
FB
920
921@section Exception support
922
923longjmp() is used when an exception such as division by zero is
df0f11a0 924encountered.
386405f7 925
df0f11a0
FB
926The host SIGSEGV and SIGBUS signal handlers are used to get invalid
927memory accesses. The exact CPU state can be retrieved because all the
928x86 registers are stored in fixed host registers. The simulated program
929counter is found by retranslating the corresponding basic block and by
930looking where the host program counter was at the exception point.
931
932The virtual CPU cannot retrieve the exact @code{EFLAGS} register because
933in some cases it is not computed because of condition code
934optimisations. It is not a big concern because the emulated code can
935still be restarted in any cases.
386405f7
FB
936
937@section Linux system call translation
938
939QEMU includes a generic system call translator for Linux. It means that
940the parameters of the system calls can be converted to fix the
941endianness and 32/64 bit issues. The IOCTLs are converted with a generic
942type description system (see @file{ioctls.h} and @file{thunk.c}).
943
df0f11a0
FB
944QEMU supports host CPUs which have pages bigger than 4KB. It records all
945the mappings the process does and try to emulated the @code{mmap()}
946system calls in cases where the host @code{mmap()} call would fail
947because of bad page alignment.
948
386405f7
FB
949@section Linux signals
950
951Normal and real-time signals are queued along with their information
952(@code{siginfo_t}) as it is done in the Linux kernel. Then an interrupt
953request is done to the virtual CPU. When it is interrupted, one queued
954signal is handled by generating a stack frame in the virtual CPU as the
955Linux kernel does. The @code{sigreturn()} system call is emulated to return
956from the virtual signal handler.
957
958Some signals (such as SIGALRM) directly come from the host. Other
959signals are synthetized from the virtual CPU exceptions such as SIGFPE
960when a division by zero is done (see @code{main.c:cpu_loop()}).
961
962The blocked signal mask is still handled by the host Linux kernel so
963that most signal system calls can be redirected directly to the host
964Linux kernel. Only the @code{sigaction()} and @code{sigreturn()} system
965calls need to be fully emulated (see @file{signal.c}).
966
967@section clone() system call and threads
968
969The Linux clone() system call is usually used to create a thread. QEMU
970uses the host clone() system call so that real host threads are created
971for each emulated thread. One virtual CPU instance is created for each
972thread.
973
974The virtual x86 CPU atomic operations are emulated with a global lock so
975that their semantic is preserved.
976
df0f11a0
FB
977Note that currently there are still some locking issues in QEMU. In
978particular, the translated cache flush is not protected yet against
979reentrancy.
980
1eb87257
FB
981@section Self-virtualization
982
4690764b 983QEMU was conceived so that ultimately it can emulate itself. Although
1eb87257
FB
984it is not very useful, it is an important test to show the power of the
985emulator.
986
987Achieving self-virtualization is not easy because there may be address
6cd9f35b
FB
988space conflicts. QEMU solves this problem by being an executable ELF
989shared object as the ld-linux.so ELF interpreter. That way, it can be
990relocated at load time.
1eb87257 991
1eb20527
FB
992@section MMU emulation
993
994For system emulation, QEMU uses the mmap() system call to emulate the
995target CPU MMU. It works as long the emulated OS does not use an area
996reserved by the host OS (such as the area above 0xc0000000 on x86
997Linux).
998
999It is planned to add a slower but more precise MMU emulation
1000with a software MMU.
1001
386405f7
FB
1002@section Bibliography
1003
1004@table @asis
1005
1006@item [1]
1007@url{http://citeseer.nj.nec.com/piumarta98optimizing.html}, Optimizing
1008direct threaded code by selective inlining (1998) by Ian Piumarta, Fabio
1009Riccardi.
1010
1011@item [2]
1012@url{http://developer.kde.org/~sewardj/}, Valgrind, an open-source
1013memory debugger for x86-GNU/Linux, by Julian Seward.
1014
1015@item [3]
1016@url{http://bochs.sourceforge.net/}, the Bochs IA-32 Emulator Project,
1017by Kevin Lawton et al.
1018
1019@item [4]
1020@url{http://www.cs.rose-hulman.edu/~donaldlf/em86/index.html}, the EM86
1021x86 emulator on Alpha-Linux.
1022
1023@item [5]
1024@url{http://www.usenix.org/publications/library/proceedings/usenix-nt97/full_papers/chernoff/chernoff.pdf},
1025DIGITAL FX!32: Running 32-Bit x86 Applications on Alpha NT, by Anton
1026Chernoff and Ray Hookway.
1027
fd429f2f
FB
1028@item [6]
1029@url{http://www.willows.com/}, Windows API library emulation from
1030Willows Software.
1031
1eb20527
FB
1032@item [7]
1033@url{http://user-mode-linux.sourceforge.net/},
1034The User-mode Linux Kernel.
1035
1036@item [8]
1037@url{http://www.plex86.org/},
1038The new Plex86 project.
1039
386405f7
FB
1040@end table
1041
1042@chapter Regression Tests
1043
322d0c66 1044In the directory @file{tests/}, various interesting testing programs
386405f7
FB
1045are available. There are used for regression testing.
1046
386405f7
FB
1047@section @file{test-i386}
1048
1049This program executes most of the 16 bit and 32 bit x86 instructions and
1050generates a text output. It can be compared with the output obtained with
1051a real CPU or another emulator. The target @code{make test} runs this
1052program and a @code{diff} on the generated output.
1053
1054The Linux system call @code{modify_ldt()} is used to create x86 selectors
1055to test some 16 bit addressing and 32 bit with segmentation cases.
1056
df0f11a0 1057The Linux system call @code{vm86()} is used to test vm86 emulation.
386405f7 1058
df0f11a0
FB
1059Various exceptions are raised to test most of the x86 user space
1060exception reporting.
386405f7 1061
285dc330
FB
1062@section @file{linux-test}
1063
1064This program tests various Linux system calls. It is used to verify
1065that the system call parameters are correctly converted between target
1066and host CPUs.
1067
1068@section @file{hello-i386}
1069
1070Very simple statically linked x86 program, just to test QEMU during a
1071port to a new host CPU.
1072
1073@section @file{hello-arm}
1074
1075Very simple statically linked ARM program, just to test QEMU during a
1076port to a new host CPU.
1077
386405f7
FB
1078@section @file{sha1}
1079
1080It is a simple benchmark. Care must be taken to interpret the results
1081because it mostly tests the ability of the virtual CPU to optimize the
1082@code{rol} x86 instruction and the condition code computations.
1083