Changes between Version 3 and Version 4 of PXEInstall


Ignore:
Timestamp:
02/23/12 13:06:04 (13 years ago)
Author:
jdreed
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • PXEInstall

    v3 v4  
    22 
    33This page covers PXE installation.  The actual [wiki:Installation Debathena Installer] is covered in a [wiki:Installation separate page]. 
     4 
     5== Infrastructure == 
     6 
     7 * The PXE server 
     8 * demeter.mit.edu (aka athena10.mit.edu): 18.9.60.73. '''This IP is hardcoded in the following places''': 
     9  * the preseed files 
     10  * the stage1 installer (for doing low-rent DNS queries) 
     11  * the actual installer for each suite (for fetching install-debathena.sh) 
     12  * TODO: probably other places, but in theory nothing outside of `trunk/athena/debathena/scripts/installer/pxe` 
    413 
    514== The PXE Server == 
     
    1221Notes: 
    1322 * We have to specify a hostname.  It will get overwritten by the DHCP config, but if DHCP barfs (which it will) or if it gets given an address with no PTR record (which it will), it needs a value there or the Debian installer fails miserably. 
     23 * This should continue to be i386 for the foreseeable future. 
     24 * The `current` directory in the url is a symlink, which is pointed at `stage1`.  You can repoint the symlink for testing, since we can't get a separate "Testing" entry on the PXE server. 
    1425 
     26== Preseeding and include_command == 
     27 
     28The preseed file generally sets a single `include_command` which runs as early in the install as possible.  The preseed command wget's the "debathena-loader.sh" file and runs it.  debathena-loader's job is to wget the installer tarball, extract it, and run the "installer.sh" file in there.  Note that in include_command, the shell is not connected to a terminal, which is why we explicitly chvt and run the script with I/O redirection.  The contents of debathena-loader.sh could all go in the single line in the initial preseed file, but it'll be ugly and we're more likely to screw up.  Also, we have no idea what the max size of an include_command is. 
     29 
     30== Stage 1 == 
     31 
     32The purpose of the Stage 1 installer is to interact with the user -- the second stage should be unattended.  At this time, we do things like ask the user what suite and metapackage to install, talk about partitioning, get network configuration, etc.    
     33 
     34First, we set some initial values:  mirrors.mit.edu as the mirror, the newest cluster release as the distro, auto partitioning, and arch of i386, unless we find the `lm` flag in /proc/cpuinfo, in which case we use amd64. 
     35 
     36The user gets a menu, and they can choose between debathena-cluster, "normal Debathena" (i.e. not -cluster), or plain vanilla Ubuntu.  (The latter is provided for convenience only).  Also, they can get plain old /bin/sh and BusyBox (and maybe anna or maybe not).  They can also select a different mirror, use the beta installer, change the distro/version/suite, change the architecture, and toggle between manual and automatic partitioning for the cluster install. 
     37 
     38Once the user has decided what they want, we move on to network configuration.  If the user has a "real" DHCP address (i.e. not 18.2 or 10.whatever), they can elect to use that and not configure network, or they can move on to static IP configuration.  (People selecting -cluster are forced to use a static IP).  Network configuration: 
     39 
     40 # Check if we can get to our low-rent DNS resolver.  If so, we can accept a hostname instead of an IP. 
     41 # Prompt the user for an IP (or hostname if DNS is available). 
     42 # If we get a hostname, look up the A record.  If there isn't one, demand an IP. 
     43 # If we get an IP, look up the PTR record, and set the target system's hostname to the value.  Otherwise, set it to `install-target-host`. 
     44 # Given the IP, use ~~magic~~ the `netparams` script and `masks` file to get the netmask, broadcast, gateway, and CIDR notation.  If we're not on cluster, give the user a chance to edit them by hand. 
     45 # Kill `dhclient` if it's running, and flush eth0.  Set the IP address, route, and other info.  Set the MIT nameservers and clobber any existing values in resolv.conf 
     46 
     47Not that we're all set, we retrieve the kexec binary, and the arch-specific net-install initrd and kernel from debathena.mit.edu/net-install.  And we set up the kernel args to pass: 
     48 * networking information (`netcfg/*`) 
     49 * keyboard layout, otherwise the user will get prompted in stage 2 (`keyboard-configuration/layoutcode` for Natty and later, `console-setup/layoutcode` for earlier releases) 
     50 * Any hardware-specific args, such as `reboot=pci` 
     51 * locale (currently forced to en_US) 
     52 * `url=http://18.9.60.73/installer/$distro/debathena.preseed` 
     53 * The user's selections for installation type, mirror, beta vs production, and partitioning type (auto vs manual).  These are prefixed with (`da/*`). 
     54  
     55We then sanity-check things (we're going to exceed 512 bytes eventually, I know it) and then kexec out of there. 
     56 
     57If we're still around after the kexec, horrible things happened, and we tell the user this. 
     58 
     59Notes: 
     60 * The initrd doesn't have `date(1)` (yes, really), so we use netcat to talk to kerberos.mit.edu (with a hardcoded IP) on the daytime port for datestamps.  This is a ~~awesome~~ terrible idea. 
     61 * A lot of assumptions are made about the way MIT's network is laid out.  If you manage to get to Stage 1 through a method other than PXE on a wired interface, terrible things will happen.  
     62 * You should be able to do useful things running the stage1 installer with `--test`. 
     63 * If your preferred network device is something other than eth0, you lose. 
     64 * If you want to install to something other than the first IDE/SATA disk, you lose. 
     65 * The stage1 installer.sh script must never exit (it kexec's out, or loops forever with a shell).  If it does exit, the install will fall through to whatever the PXE server initially gave us, which is undoubtedly wrong.