Using gacutil.exe and Reflection to generate the Exceptions Cheat Sheet

The most recent addition to my Cheat Sheets features details of all the Exceptions that can be found in the most commonly used assemblies within ASP.NET development. Compiling this information was an interesting challenge. I could have simply copy-pasted from MSDN, but that would have been extremely tedious. Instead, I ended up with a blend of Linq to XML, Reflection, a dash of Regex and the Global Assembly Cache Tool - gacutil.exe. Here's the full story.

Assemblies in the .NET Framework are discrete deployments of compiled code. Although Assemblies can contain more than one module, each of the .NET Framework assemblies contain just one. Among other things, each assembly contains Types. For the purposes of this exercise, I am only interested in types which are of type Exception. This is the base class from which all other Exceptions derive.

Locally, .NET Assemblies are stored in the Global Assembly Cache (GAC). However, I am not interested in obscure exception classes in assemblies that Web Forms developers won't be using, such as AudioException (found in System.Speech.dll). So I need a way to list all assemblies in the GAC and then select just those I am interested in. Second, I need to identify all objects of type Exception in them, and third, I need to generate some documentation that describes exactly what those exceptions mean, or what causes them for the cheat sheet to have any real value.

Gacutil.exe is a command line tool that comes as part of the .NET Framework SDK. One of the things this tool can do is to list all assemblies registered in the GAC. It will actually list them using their fully qualified name, such as System.Data, Version=2.0.0.0, Culture=neutral, PublicKeyToken=b77a5c561934e08. This is very useful, as it is this that will be needed later for when Reflection is used on the assembly.

The following code generates a List of AssemblyName objects. The Gacutil tool is located and fired up using a Process instance.  The Argument switch (a letter "l") tells the tool to simply list items in the GAC. The list is streamed as output which is then read line by line. If a line meets the criteria (it has a version, is mscorlib or begins with System) it is added as an Assembly obect to the collection. The reason for the filtering criteria is that my GAC is full to the brim with all sort of assemblies belonging to control sets I have installed among other things. I don't want a never-ending list.


static List<AssemblyName> AssemblyList()
{
  var assemblyList = new List<AssemblyName>();
  const string utilPath = @"C:\Program Files\Microsoft SDKs\Windows\v6.0A\bin\gacutil.exe";
  using (var process = new Process
                      {
                        StartInfo =
                          {
                            FileName = utilPath,
                            RedirectStandardOutput = true,
                            UseShellExecute = false,
                            CreateNoWindow = true,
                            Arguments = "/l"
                          }
                      })
  {
    process.Start();
    using (var stream = process.StandardOutput)
    {
      while (!stream.EndOfStream)
      {
        var output = stream.ReadLine();
        if (output.Length > 0)
        {
          AssemblyName an;
          try
          {
            an = new AssemblyName(output);
            if (an.Version != null && (an.Name.StartsWith("mscorlib") 
                                   || an.Name.StartsWith("System")))
            {
              assemblyList.Add(an);
            }
          }
          catch (Exception ex)
          {
            an = null;
          }
        }
      }
    }
  }
  return assemblyList;
}

In this particular case I simply bound an instance of AssemblyList to a CheckBoxList control in a web form:


Assemblies.DataSource = AssemblyList();
Assemblies.DataBind();

From here, I can select the assemblies I want to interrogate. When the page is posted back, I can iterate the CheckBoxList items to see which ones were selected, and add their value to a List<string> (which I call assembyList). This collection is then passed into another method:


List<ExceptionInfo> ListExceptions(IEnumerable<string> assemblyList)
{
  var exInfo = new List<ExceptionInfo>();
  var assemblies = assemblyList.Select(assemblyName => Assembly.Load(assemblyName));
  foreach (var assembly in assemblies)
  {
    var version = assembly.GetName().Version;
    var module = assembly.GetModules().FirstOrDefault();
    XDocument doc = GetDocForModule(module, version);
    var types = module.GetTypes().Where(t => t.IsSubclassOf(typeof(Exception))).ToList();
    foreach (var t in types)
    {
      var data = new ExceptionInfo
                   {
                     AssemblyName = module.Name.Replace(".dll", ""),
                     ExceptionName = t.Name,
                     ExceptionDescription = GetExceptionDescription(t, doc)
                   };
      exInfo.Add(data);
    }
  }
  return exInfo.OrderBy(e => e.AssemblyName).ThenBy(e => e.ExceptionName).ToList();
}

 

This method returns a List<ExceptionInfo>, which is a collection of a simple custom object:


class ExceptionInfo
{
  public string AssemblyName { get; set; }
  public string ExceptionName { get; set; }
  public string ExceptionDescription { get; set; }
}

Each assembly is loaded in turn, and its module identified. The GetDocForModule() method attempts to locate and load an XML document:


static XDocument GetDocForModule(Module module, Version version)
{
  var file = String.Empty;
  if (version.Major == 2)
  {

    file = string.Format(@"C:\WINDOWS\Microsoft.NET\Framework\v2.0.50727\en\{0}",
                              module.Name.Replace("dll", "xml"));
  }
  if (version.Major == 3 && version.Minor == 0)
  {
    file = string.Format(@"C:\Program Files\Reference Assemblies\Microsoft\Framework\v3.0\en\{0}",
                              module.Name.Replace("dll", "xml"));
  }
  if (version.Major == 3 && version.Minor == 5)
  {
    file = string.Format(@"C:\Program Files\Reference Assemblies\Microsoft\Framework\v3.5\{0}",
                              module.Name.Replace("dll", "xml"));
  }
  if (file != String.Empty)
  {
    using (var reader = new XmlTextReader(file))
    {
      XDocument doc;
      try
      {
        doc = XDocument.Load(reader);
      }
      catch (IOException ex)
      {
        return null;
      }
      return doc;
    }
  }
  return null;
}

Most Base Class Library assemblies in the GAC have a matching XML document which contains the xml comments from the source code stripped out and separated. This includes the <summary> information which is used to describe the class, method, event, type etc:

  

<member name="T:System.Data.Linq.ChangeConflictException">
  <summary>
      Thrown when an update fails because database values have been updated since the client last read them.
  </summary>
</member>

Locating these xml documents in the first place was a pain. The actual location depends on the .NET Framework version that the dll belongs to, or was last updated by. They are splattered all over the place. The default locations for versions 2.0, 3.0 and 3.5 can be seen in the file paths in the code above, but if you have MVC installed, or the Chart controls etc separately (pre-4.0) and you want to examine them, you need to located the directory in which the installers have placed the dlls. The xml files should also be there. The assembly version has been passed into the method so that the correct location can be referenced based on this.

Going back to the ListExceptions() method, you can see that each module is examined via Reflection for its types, and if those types are of type Exception, an ExceptionInfo object is created and added to the collection. The AssemblyName and ExceptionName are readily retrieved, but another method - GetExceptionDescription() is responsible for providing a value for the ExceptionDescription property:


static string GetExceptionDescription(Type t, XDocument doc)
{
  if (doc != null)
  {
    var result =
      doc.Root.Elements("members").Elements("member").Where(
        member => member.Attribute("name").Value == "T:" + t.FullName);
    if (result.Count() > 0)
    {
      //Need to use CreateReader to get all of the XElement content 
      //otherwise it ignores child elements and attributes
      var xmlreader = result.Elements("summary").First().CreateReader();
      xmlreader.MoveToContent();
      var summary = xmlreader.ReadInnerXml();
      const string pattern = "<see cref=\"[T|P|M|E|N]:";
      summary = Regex.Replace(summary.Trim(), pattern, "");
      return summary.Replace("\" />", "").Replace("<see cref=\"Overload:", "");
    }
  }
  return null;
}

The XDocument is loaded into memory once per module. As the Exceptions are iterated in the ListExceptions() method, corresponding <member> nodes are located within the XML and passed to the variable result. This is an IEnumerable<XElement>. Since there should only be one node that matches, it is passed to an XMLReader so that its entire contents are captured using the ReadInnerXml() method. Failure to do this will mean that <see cref="xxx> attributes are ignored. We want them as part of the string, because they invariable point to other types, methods, members etc as part of the summary. The actual markup is removed through a Regex.Replace and String.Replace combination to leave the bare type, method etc. The final collection of ExceptionInfo objects was iterated to a StringBuilder object in my case, to create the html table that displays the results.

When I started out with the simple idea of compiling a cheat sheet of Exceptions, I had no idea that it would lead me to this. Along the way, I discovered Gacutil.exe for the first time, delved a bit into the strucrture of .NET Framework assemblies, reflected on Reflection, understood a bit more about how VS and Intellisense work, stretched Lambdas a little more than I usually do, and played a bit more with Linq To XML. You might think that the end result is a little pointless, but hopefully you will have found the journey there as interesting as I did.