<?xml version="1.0" encoding="utf-8"?>
<rss xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:pingback="http://madskills.com/public/xml/rss/module/pingback/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0">
  <channel>
    <title>invalid operation - metadata</title>
    <link>http://blog.invalidoperation.com/</link>
    <description>how do i wrote codes</description>
    <language>en-us</language>
    <copyright>ray</copyright>
    <lastBuildDate>Sun, 12 Apr 2009 03:40:25 GMT</lastBuildDate>
    <generator>newtelligence dasBlog 2.2.8279.16125</generator>
    <managingEditor>ray.oneill@gmail.com</managingEditor>
    <webMaster>ray.oneill@gmail.com</webMaster>
    <item>
      <trackback:ping>http://blog.invalidoperation.com/Trackback.aspx?guid=0c698fee-cd98-428d-81d5-1b35ea521552</trackback:ping>
      <pingback:server>http://blog.invalidoperation.com/pingback.aspx</pingback:server>
      <pingback:target>http://blog.invalidoperation.com/PermaLink,guid,0c698fee-cd98-428d-81d5-1b35ea521552.aspx</pingback:target>
      <dc:creator>Ray</dc:creator>
      <wfw:comment>http://blog.invalidoperation.com/CommentView,guid,0c698fee-cd98-428d-81d5-1b35ea521552.aspx</wfw:comment>
      <wfw:commentRss>http://blog.invalidoperation.com/SyndicationService.asmx/GetEntryCommentsRss?guid=0c698fee-cd98-428d-81d5-1b35ea521552</wfw:commentRss>
      <body xmlns="http://www.w3.org/1999/xhtml">
        <p>
          <a href="http://blog.invalidoperation.com/content/binary/WindowsLiveWriter/FunwithLinqtoSchema_1488A/schemaservice_2.jpg">
            <img title="schemaservice" style="border-top-width: 0px; display: inline; border-left-width: 0px; border-bottom-width: 0px; border-right-width: 0px" height="497" alt="schemaservice" src="http://blog.invalidoperation.com/content/binary/WindowsLiveWriter/FunwithLinqtoSchema_1488A/schemaservice_thumb.jpg" width="663" border="0" />
          </a>
        </p>
        <p>
What, you mean you <em>don’t</em> expose your database metadata via an Ado.Net data
service?
</p>
        <img width="0" height="0" src="http://blog.invalidoperation.com/aggbug.ashx?id=0c698fee-cd98-428d-81d5-1b35ea521552" />
      </body>
      <title>Fun with Linq to Schema</title>
      <guid isPermaLink="false">http://blog.invalidoperation.com/PermaLink,guid,0c698fee-cd98-428d-81d5-1b35ea521552.aspx</guid>
      <link>http://blog.invalidoperation.com/2009/04/12/FunWithLinqToSchema.aspx</link>
      <pubDate>Sun, 12 Apr 2009 03:40:25 GMT</pubDate>
      <description>&lt;p&gt;
&lt;a href="http://blog.invalidoperation.com/content/binary/WindowsLiveWriter/FunwithLinqtoSchema_1488A/schemaservice_2.jpg"&gt;&lt;img title="schemaservice" style="border-top-width: 0px; display: inline; border-left-width: 0px; border-bottom-width: 0px; border-right-width: 0px" height="497" alt="schemaservice" src="http://blog.invalidoperation.com/content/binary/WindowsLiveWriter/FunwithLinqtoSchema_1488A/schemaservice_thumb.jpg" width="663" border="0" /&gt;&lt;/a&gt; 
&lt;/p&gt;
&lt;p&gt;
What, you mean you &lt;em&gt;don’t&lt;/em&gt; expose your database metadata via an Ado.Net data
service?
&lt;/p&gt;
&lt;img width="0" height="0" src="http://blog.invalidoperation.com/aggbug.ashx?id=0c698fee-cd98-428d-81d5-1b35ea521552" /&gt;</description>
      <comments>http://blog.invalidoperation.com/CommentView,guid,0c698fee-cd98-428d-81d5-1b35ea521552.aspx</comments>
      <category>meta</category>
      <category>metadata</category>
    </item>
    <item>
      <trackback:ping>http://blog.invalidoperation.com/Trackback.aspx?guid=e438521a-dadc-4722-b54c-6756f3d3db43</trackback:ping>
      <pingback:server>http://blog.invalidoperation.com/pingback.aspx</pingback:server>
      <pingback:target>http://blog.invalidoperation.com/PermaLink,guid,e438521a-dadc-4722-b54c-6756f3d3db43.aspx</pingback:target>
      <dc:creator>Ray</dc:creator>
      <wfw:comment>http://blog.invalidoperation.com/CommentView,guid,e438521a-dadc-4722-b54c-6756f3d3db43.aspx</wfw:comment>
      <wfw:commentRss>http://blog.invalidoperation.com/SyndicationService.asmx/GetEntryCommentsRss?guid=e438521a-dadc-4722-b54c-6756f3d3db43</wfw:commentRss>
      <slash:comments>1</slash:comments>
      <body xmlns="http://www.w3.org/1999/xhtml">
        <p>
 
</p>
        <p>
          <a href="http://blog.invalidoperation.com/2009/02/16/DatabaseMetadataInNetPart1TheProblem.aspx">Getting
at your database metadata is a pain in the ass</a>. In my casual development of NDeavor,
I’m also writing some providers for various database platforms, currently focused
around MySql because it has good .NET support and it’s not Sql Server, which I actually
work with every day (in the professional sense).  It became clear pretty quickly
that without a fully-featured API like SMO that using the functionality in .GetSchema()
off of an ADO.NET-compliant vendor’s connection class is the best bet all around.
</p>
        <p>
Being that I want a solution I can use for all the providers I end up doing for NDeavor,
and be as lightweight as possible, <a href="http://www.hanselman.com/blog/T4TextTemplateTransformationToolkitCodeGenerationBestKeptVisualStudioSecret.aspx">T4
templates</a> was the best choice. Oh, and I want it LINQ-enabled as well. It will
basically do the following: 
<br />
- Call .GetSchema() on a DbConnection instance to get available metadata (as well
as specific collections you tell it to). 
<br />
- Ouput a C# POCO class with properties mapping to each column of the metadata’s DataTable
that GetSchema() returns 
<br />
- Output a DataContext-like class allowing for LINQ goodness.
</p>
        <p>
You might think:
</p>
        <blockquote>
          <p>
            <em>“Really? You’re getting the schema of a DataTable returned from </em>
            <em>DbConnection.GetSchema()
so it can used to create the schema for a C# class which will be used to query the
schema of a relational database?”</em>
          </p>
        </blockquote>
        <p>
and you’d be right. So?
</p>
        <p>
The idea of DBConnection.GetSchema(), in Microsoft terms, was to allow querying of
vendor-specific metadata about the underlying data source, so as a lowest-common-denominator
it returns instances of the ubiquitous DataTable and DataSet. Passing around loosely
typed blobs of data won’t cut it, so we’ll be mapping these to strongly types classes,
similar to that of an ORM framework. That sounds tedious and like a job for some lightweight
code generation, in this case it will be some T4 template magic in Visual Studio 2008.
These classes will just be thin wrappers over DataRows, they won’t have the associations/relationship
semantics you’re used to seeing with Linq to Sql or Entity Framework, they’re just
glue to support building up a larger more api-friendly object model. Any of the joins
you do from the resulting object collections will be in-memory, there’s nothing dynamically
querying the underlying schema. It’s all smoke and mirrors.
</p>
        <h4>5 Steps to LINQ-ifying to your Db Schema
</h4>
        <p>
          <a href="/downloads/LinqToSchema_04112009.zip">Download LinqToSchema</a>
        </p>
        <ol>
          <li>
Add LinqToSchema.tt to your Visual Studio 2008 project, and remove the ‘Custom Tool’
setting – this is not the template itself, just for inclusion inside the ‘host’ template. 
</li>
          <li>
Create a new T4 template by adding a text file to your project and renaming the extension
to .tt 
</li>
          <li>
Write the template to look similar to this: <pre class="c#" name="code">&lt;#@ template language="C#v3.5" hostspecific="True" debug="true" #&gt;
&lt;#@ output extension="cs" #&gt;
&lt;#@ include file="LinqToSchema.tt" #&gt;
&lt;# 
    LinqToSchemaSettings settings = new LinqToSchemaSettings
    {
        ConnectionStringName = "Chinook_MySql",    // name of connection string in local App.Config/Web.Config - must be in same dir as this file
        TargetNamespace = "Chinook.Schema.MySql",    // what namespace to put the generated code in
        ClassNamePrefix = "MySql",                        // what prefix, if any, to put on the generated class names
        ClassVisibility = Visibility.Public                // visibility of the generated classes (Public or Internal)
    };
   
    LinqToSchemaClassesGenerator generator = new LinqToSchemaClassesGenerator(settings);
    generator.Run();
#&gt;</pre></li>
          <li>
Save your just-created template for it to be processed by the T4 engine. 
</li>
          <li>
Compile/write other code that uses the generated classes. Such as: 
<br /><pre class="c#" name="code">using (MySqlSchemaContext ctx = new MySqlSchemaContext(@"server=servername;user id=userid;persist security info=True;database=chinook"))
{
    var query = from tbl in ctx.Tables
                join col in ctx.Columns on tbl.TableName equals col.TableName
                where tbl.TableName.ToLower() == "track"
                select new
                {
                    tbl.TableCatalog,
                    tbl.TableName,
                    col.ColumnName,
                    col.DataType,
                    col.IsNullable
                };
    query.ToList().ForEach(p =&gt; Console.WriteLine(p));
}</pre></li>
        </ol>
        <p>
There are some more options such as explicitly setting the connection string, which
are explained in the readme. And if it wasn’t already clear, because this is using
DbConnection.GetSchema(), it’s usable on any database platform that returns meaningful
metadata using from their platform-specific DbConnection implementation.
</p>
        <img width="0" height="0" src="http://blog.invalidoperation.com/aggbug.ashx?id=e438521a-dadc-4722-b54c-6756f3d3db43" />
      </body>
      <title>Database Metadata in .Net Part 2 : LINQ To Schema</title>
      <guid isPermaLink="false">http://blog.invalidoperation.com/PermaLink,guid,e438521a-dadc-4722-b54c-6756f3d3db43.aspx</guid>
      <link>http://blog.invalidoperation.com/2009/04/12/DatabaseMetadataInNetPart2LINQToSchema.aspx</link>
      <pubDate>Sun, 12 Apr 2009 01:55:36 GMT</pubDate>
      <description>&lt;p&gt;
&amp;#160;
&lt;/p&gt;
&lt;p&gt;
&lt;a href="http://blog.invalidoperation.com/2009/02/16/DatabaseMetadataInNetPart1TheProblem.aspx"&gt;Getting
at your database metadata is a pain in the ass&lt;/a&gt;. In my casual development of NDeavor,
I’m also writing some providers for various database platforms, currently focused
around MySql because it has good .NET support and it’s not Sql Server, which I actually
work with every day (in the professional sense).&amp;#160; It became clear pretty quickly
that without a fully-featured API like SMO that using the functionality in .GetSchema()
off of an ADO.NET-compliant vendor’s connection class is the best bet all around.
&lt;/p&gt;
&lt;p&gt;
Being that I want a solution I can use for all the providers I end up doing for NDeavor,
and be as lightweight as possible, &lt;a href="http://www.hanselman.com/blog/T4TextTemplateTransformationToolkitCodeGenerationBestKeptVisualStudioSecret.aspx"&gt;T4
templates&lt;/a&gt; was the best choice. Oh, and I want it LINQ-enabled as well. It will
basically do the following: 
&lt;br /&gt;
- Call .GetSchema() on a DbConnection instance to get available metadata (as well
as specific collections you tell it to). 
&lt;br /&gt;
- Ouput a C# POCO class with properties mapping to each column of the metadata’s DataTable
that GetSchema() returns 
&lt;br /&gt;
- Output a DataContext-like class allowing for LINQ goodness.
&lt;/p&gt;
&lt;p&gt;
You might think:
&lt;/p&gt;
&lt;blockquote&gt; 
&lt;p&gt;
&lt;em&gt;“Really? You’re getting the schema of a DataTable returned from &lt;/em&gt;&lt;em&gt;DbConnection.GetSchema()
so it can used to create the schema for a C# class which will be used to query the
schema of a relational database?”&lt;/em&gt;
&lt;/p&gt;
&lt;/blockquote&gt; 
&lt;p&gt;
and you’d be right. So?
&lt;/p&gt;
&lt;p&gt;
The idea of DBConnection.GetSchema(), in Microsoft terms, was to allow querying of
vendor-specific metadata about the underlying data source, so as a lowest-common-denominator
it returns instances of the ubiquitous DataTable and DataSet. Passing around loosely
typed blobs of data won’t cut it, so we’ll be mapping these to strongly types classes,
similar to that of an ORM framework. That sounds tedious and like a job for some lightweight
code generation, in this case it will be some T4 template magic in Visual Studio 2008.
These classes will just be thin wrappers over DataRows, they won’t have the associations/relationship
semantics you’re used to seeing with Linq to Sql or Entity Framework, they’re just
glue to support building up a larger more api-friendly object model. Any of the joins
you do from the resulting object collections will be in-memory, there’s nothing dynamically
querying the underlying schema. It’s all smoke and mirrors.
&lt;/p&gt;
&lt;h4&gt;5 Steps to LINQ-ifying to your Db Schema
&lt;/h4&gt;
&lt;p&gt;
&lt;a href="/downloads/LinqToSchema_04112009.zip"&gt;Download LinqToSchema&lt;/a&gt;
&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
Add LinqToSchema.tt to your Visual Studio 2008 project, and remove the ‘Custom Tool’
setting – this is not the template itself, just for inclusion inside the ‘host’ template. 
&lt;/li&gt;
&lt;li&gt;
Create a new T4 template by adding a text file to your project and renaming the extension
to .tt 
&lt;/li&gt;
&lt;li&gt;
Write the template to look similar to this: &lt;pre class="c#" name="code"&gt;&amp;lt;#@ template language=&amp;quot;C#v3.5&amp;quot; hostspecific=&amp;quot;True&amp;quot; debug=&amp;quot;true&amp;quot; #&amp;gt;
&amp;lt;#@ output extension=&amp;quot;cs&amp;quot; #&amp;gt;
&amp;lt;#@ include file=&amp;quot;LinqToSchema.tt&amp;quot; #&amp;gt;
&amp;lt;# 
    LinqToSchemaSettings settings = new LinqToSchemaSettings
    {
        ConnectionStringName = &amp;quot;Chinook_MySql&amp;quot;,    // name of connection string in local App.Config/Web.Config - must be in same dir as this file
        TargetNamespace = &amp;quot;Chinook.Schema.MySql&amp;quot;,    // what namespace to put the generated code in
        ClassNamePrefix = &amp;quot;MySql&amp;quot;,                        // what prefix, if any, to put on the generated class names
        ClassVisibility = Visibility.Public                // visibility of the generated classes (Public or Internal)
    };
   
    LinqToSchemaClassesGenerator generator = new LinqToSchemaClassesGenerator(settings);
    generator.Run();
#&amp;gt;&lt;/pre&gt;
&lt;/li&gt;
&lt;li&gt;
Save your just-created template for it to be processed by the T4 engine. 
&lt;/li&gt;
&lt;li&gt;
Compile/write other code that uses the generated classes. Such as: 
&lt;br /&gt;
&lt;pre class="c#" name="code"&gt;using (MySqlSchemaContext ctx = new MySqlSchemaContext(@&amp;quot;server=servername;user id=userid;persist security info=True;database=chinook&amp;quot;))
{
    var query = from tbl in ctx.Tables
                join col in ctx.Columns on tbl.TableName equals col.TableName
                where tbl.TableName.ToLower() == &amp;quot;track&amp;quot;
                select new
                {
                    tbl.TableCatalog,
                    tbl.TableName,
                    col.ColumnName,
                    col.DataType,
                    col.IsNullable
                };
    query.ToList().ForEach(p =&amp;gt; Console.WriteLine(p));
}&lt;/pre&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;
There are some more options such as explicitly setting the connection string, which
are explained in the readme. And if it wasn’t already clear, because this is using
DbConnection.GetSchema(), it’s usable on any database platform that returns meaningful
metadata using from their platform-specific DbConnection implementation.
&lt;/p&gt;
&lt;img width="0" height="0" src="http://blog.invalidoperation.com/aggbug.ashx?id=e438521a-dadc-4722-b54c-6756f3d3db43" /&gt;</description>
      <comments>http://blog.invalidoperation.com/CommentView,guid,e438521a-dadc-4722-b54c-6756f3d3db43.aspx</comments>
      <category>database</category>
      <category>Linq To Schema</category>
      <category>meta</category>
      <category>metadata</category>
      <category>NDeavor</category>
    </item>
    <item>
      <trackback:ping>http://blog.invalidoperation.com/Trackback.aspx?guid=5b20c588-314d-4ce2-b9f1-ed07b4fb7793</trackback:ping>
      <pingback:server>http://blog.invalidoperation.com/pingback.aspx</pingback:server>
      <pingback:target>http://blog.invalidoperation.com/PermaLink,guid,5b20c588-314d-4ce2-b9f1-ed07b4fb7793.aspx</pingback:target>
      <dc:creator>Ray</dc:creator>
      <wfw:comment>http://blog.invalidoperation.com/CommentView,guid,5b20c588-314d-4ce2-b9f1-ed07b4fb7793.aspx</wfw:comment>
      <wfw:commentRss>http://blog.invalidoperation.com/SyndicationService.asmx/GetEntryCommentsRss?guid=5b20c588-314d-4ce2-b9f1-ed07b4fb7793</wfw:commentRss>
      <slash:comments>1</slash:comments>
      <body xmlns="http://www.w3.org/1999/xhtml">
        <p>
For most ORM or data layer utilities there is some component of code generation involved
and somewhere along the line retrieving some metadata about the underlying structure
of a database is usually involved. Seeing as it relates to <a href="http://blog.invalidoperation.com/2008/12/11/CurrentProjectNDeavor.aspx">NDeavor</a>,
I figured I’d write about the ins and outs of getting metadata when running close
to the database. Basically, there are a few basic ways to access database specific
metadata directly through .Net.
</p>
        <h5>Vendor-specific API.
</h5>
        <p>
I only know of one comprehensive API for this and it’s <a href="http://www.microsoft.com/downloads/details.aspx?familyid=d09c1d60-a13c-4479-9b91-9e8b9d835cdc&amp;displaylang=en">Sql
Management Objects (SMO)</a> which is Sql-Server only. A tremendously useful API for
any Microsoft-dominated shops, though. Enumerating through basic structures like tables
and columns is incredibly easy, if a little slow. Getting the table name, column name,
and data type name of every column in every table in the <a href="http://www.codeplex.com/ChinookDatabase">chinook</a> database
might look like this:
</p>
        <pre class="c#" name="code">public void PrintChinookTableColumns()
{
    ServerConnection sc = new ServerConnection(@".\SQLEXPRESS");
    sc.Connect();
    Server server = new Server(sc);
    Database chinookDb = server.Databases["chinook"];
    server.SetDefaultInitFields(typeof(Table), "IsSystemObject");
    foreach (Table table in chinookDb.Tables)
    {
        if (table.IsSystemObject)
            continue;

        string tableName = table.Name;

        foreach (Column col in table.Columns)
        {
            string columnName = col.Name;
            string dataTypeName = col.DataType.Name;

            Console.WriteLine("{0} {1} {2}", tableName, columnName, dataTypeName);
        }
    }
    sc.Disconnect();
}</pre>
        <p>
 
</p>
        <h5>Querying vendor-specific system tables. 
</h5>
        <p>
          <strong>
          </strong>Vendor lock-in at it’s finest, and typically unsupported at that.
Usefulness is dependent on what level of information is available and the amount of
time you’re willing to spend figuring out what the various columns and values mean.
Getting table, column, and data type names in a specific database in Sql Server Express
2005 might look like this:
</p>
        <pre class="sql" name="code">use chinook
go

SELECT  T.name as TableName, 
        C.name as ColumnName,
        ST.name as DataType
FROM 
    sys.tables T 
    INNER JOIN sys.columns C on T.object_id = C.object_id
    INNER JOIN sys.systypes ST on C.system_type_id = ST.type</pre>
        <h5> 
</h5>
        <h5>Querying <em>standard</em> metadata tables.
</h5>
        <p>
          <strong>
          </strong>Emphasis on ‘standard’, because there really is none. There is a
spec as part of the SQL standards that came along sometime around SQL92/SQL99 (not
sure) which dictates the INFORMATION_SCHEMA tables which store very basic metadata
about the tables, views, columns, procedures, functions, parameters, constraints,
etc, of a specific relational database. Support for these is relative to how much
the vendor cares about doing so. Sql Server and MySql both have pretty strong support
for these. But even then, the spec is very loose and while simple queries to say,
get all table names and column names, might be portable, more complex ones likely
won’t be. There’s other discrepancies as well, for instance the verbiage around ‘schema’
and ‘catalog’. In MySql terms <em>schema</em> is analogous to a Sql Server <em>database</em>,
and this translates down into their implementation of these metadata tables. Back
to the example of tables, columns, and data types in the chinook database:
</p>
        <pre class="sql" name="code">Sql Server: 
use chinook 
go 

SELECT 
    T.TABLE_NAME as TableName,    
    C.COLUMN_NAME as ColumnName, 
    C.DATA_TYPE as DataType 
FROM 
    INFORMATION_SCHEMA.TABLES T 
    INNER JOIN INFORMATION_SCHEMA.COLUMNS C ON 
        T.TABLE_NAME = C.TABLE_NAME AND 
        T.TABLE_SCHEMA = C.TABLE_SCHEMA AND 
        T.TABLE_CATALOG = C.TABLE_CATALOG

 

MySql: 
SELECT 
    T.TABLE_NAME as TableName, 
    C.COLUMN_NAME as ColumnName, 
    C.DATA_TYPE as DataType 
FROM 
    INFORMATION_SCHEMA.TABLES T 
    INNER JOIN INFORMATION_SCHEMA.COLUMNS C ON 
        T.TABLE_NAME = C.TABLE_NAME AND 
        T.TABLE_SCHEMA = C.TABLE_SCHEMA 
WHERE T.TABLE_SCHEMA = 'chinook';</pre>
        <h5> 
</h5>
        <h5>Using the schema-related functionality in Ado.Net 2.0. 
</h5>
        <p>
Introduced in .Net 2.0, the DbConnection base class has a method with a few overloads
called GetSchema(). This is supposed to allow implementers of Ado.Net providers a
way to provide metadata about the underlying connected data store. Since it’s not
strongly typed, and consists of returning DataTables, it’s entirely up to the implementer
as far as what information is provided. Typically calling the parameter-less version
of GetSchema() would return a DataTable that describes what metadata is available,
indicated by a string column called something like <em>MetaDataCollection</em> in
which you can pass to subsequent calls to GetSchema(string) to get that specific chunk
of metadata. There’s a decent write up of what’s available in the native Ado.Net providers
located <a href="http://msdn.microsoft.com/en-us/library/kcax58fh.aspx">here</a>.
Here’s an extension method to get all the metadata into one dataset for further manipulation,
which should work for the built-in Ado.Net providers, as well as MySql.Data.MysqlClient
using the latest <a href="http://dev.mysql.com/downloads/connector/net/5.2.html">MySql
Net Connector</a>:
</p>
        <pre class="c#" name="code">using System.Collections.Generic;
using System.Data;
using System.Data.Common;
using System.Linq;

namespace System.Data
{
    public static class DbConnectionExtensions
    {
        public static DataSet GetSchemaSet(this DbConnection connection)
        {
            DataSet result = new DataSet();

            DataTable collections = connection.GetSchema();

            List&lt;string&gt; availableCollectionNames = (from row in collections.AsEnumerable()
                                                     select row.Field&lt;string&gt;("CollectionName"))
                                                     .Distinct()
                                                     .ToList();

            foreach (string collectionName in availableCollectionNames)
            {
                try
                {
                    DataTable schemaTable = connection.GetSchema(collectionName);
                    result.Tables.Add(schemaTable);
                }
                catch { }
                // this is bad form and shouldn't be necessary but SqlClient seems to try to give Sql2008 metadata for 
                // Sql2005 connections which results in unneeded exceptions
            }

            return result;
        }
    }
}</pre>
        <p>
The other mechanism for getting metadata is the GetSchemaTable() method on the System.Data.IDataReader
interface, which provides some limited information on the metadata for the columns
that exist in a particular instance of IDataReader, typically from a call to DbCommand.ExecuteReader().
Again, GetSchemaTable(), as you might have guessed, returns a DataTable, this time
with structural information only relating to the columns of the IDataReader itself.
This, combined with DbDataReader.GetName(int) and DbDataReader.GetDataTypeName(int)
can give some pretty useful information for table and column structure returned from
a select statement. That being said, unless you’re reverse-engineering result sets
from stored procedures, you’ll likely find the DbConnection.GetSchema() functionality
much more effective.
</p>
        <p>
If you’re using the System.Data.OleDb provider, there’s the GetOleDbSchemaTable method
on the OleDbConnection class which provides its own metadata about the underlying
database. It works a little differently and takes a System.Guid value which refers
to the type of metadata being requested, all of which are static fields in the System.Data.OleDb.OleDbSchemaGuid
class.
</p>
        <pre class="c#" name="code">using System.Collections.Generic;
using System.Data.OleDb;
using System.Linq;
using System.Reflection;

namespace System.Data
{
    public static class OleDbConnectionExtensions
    {
        public static DataSet GetOleDbSchemaSet(this OleDbConnection connection)
        {
            DataSet result = new DataSet();
            List&lt;FieldInfo&gt; guidMembers = typeof(OleDbSchemaGuid).GetFields(BindingFlags.Static | BindingFlags.Public).ToList();

            foreach (FieldInfo field in guidMembers)
            {
                if (field.FieldType == typeof(Guid))
                {
                    Guid val = (Guid)field.GetValue(null);

                    try
                    {
                        DataTable schemaTable = connection.GetOleDbSchemaTable(val, new object[] { });
                        result.Tables.Add(schemaTable);
                    }
                    catch { } // unfortunately not all schema guids supported by all oledb connections necessarily
                }
            }

            return result;
        }
    }
}</pre>
        <h5>Oddities and Errata
</h5>
        <p>
Under .NET 3.5 (maybe SP1 only), if you execute .GetSchema() on a System.Data.SqlClient.SqlConnection
connected to a Sql Server 2005, you’ll notice there's a row indicating that the metadatacollection
called <em>StructuredTypeMembers</em> is available. However, this is Sql Server 2008-specific
and will throw an exception if you call .GetSchema(“StructuredTypeMembers”) on that
same Sql 2005 connection. Why Microsoft let it even be exposed during calls to .GetSchema()
on non-2008 connections, who knows. They already have a precedent for exposing Sql
2008-specific metadata depending on 2005/2008 server version as indicated <a href="http://msdn.microsoft.com/en-us/library/ms254969.aspx">here</a> so
it seems pretty stupid on their part. That’s the reason for the empty catch statement
in the code block above.
</p>
        <p>
Using <a href="http://dev.mysql.com/downloads/connector/net/5.2.html">MySql Connector
Net 5.2.5</a>, the schema collection of <em>Foreign Key Columns</em> is available
but not indicated by the resulting DataTable returned by MySqlConnection.GetSchema().
I notified <a href="http://www.reggieburnett.com/">Reggie Burnett</a>, who maintains
the Net Connector, and he says it’s been fixed, so I’d expect to see that in a future
release.
</p>
        <p>
Getting the structure of result sets from stored procedures or functions is even more
difficult than any of the basic table/column structure stuff I’ve mentioned. Using
Sql Server, you can use the <em>SET FMTONLY ON </em>statement before the stored procedure
call, and <em>SET FMTONLY OFF </em>afterwards, in order to get an empty result set
from which you can derive schema without executing against live data. However, to
do that in code, you’d have to derive their parameters using some other means like
GetSchema() and build a statement block with the FMTONLY lines surrounding the procedure
call. Oh, and none of this works if the stored procedure uses temporary tables to
return that result set. And again, it’s Sql Server specific. For MySql, the only way
I’ve figured out is to:
</p>
        <ol>
          <li>
Derive the parameter information using MySqlConnection.GetSchema(“Procedure Parameters”) 
</li>
          <li>
Construct a MySqlCommand with command text equal to the procedure name and CommandType
set to CommandType.StoredProcedure 
</li>
          <li>
Add phony parameters using the previously-derived information and set their values
to DBNull.Value. 
</li>
          <li>
Execute the command via ExecuteReader() 
</li>
          <li>
Reverse engineer the schema using GetName(int), GetDataTypeName(int) on the MySqlDataReader,
in tandem with GetSchemaTable(). Theoretically, if the information in the schema table
indicates the column came from a physical table, one could go to the source and look
at the values of MySqlConnection.GetSchema(“Columns”) for the source column definitions. 
</li>
        </ol>
        <p>
That’s likely how I’ll be doing it in the NDeavor providers where stored procedures
are supported like Oracle and Postgres. It’s less than ideal, but if you’re still
following this, you’d realize that’s my point! 
</p>
        <h5>Adding It Up
</h5>
        <p>
There’s a pattern to getting at your metadata and that pattern is it’s a pain in the
ass. I’m hoping NDeavor will provide an easy means of getting database schema out
of various platforms for the purposes of artifact-generation. But even so, I’m not
exactly planning on writing each NDeavor provider as its own one-off implementation
because there’s such inconsistency between metadata across platforms. I have a way
forward for a more lower-level means of getting at this stuff that solves a lot of
the cross-cutting discrepancies, and that will be in part 2.
</p>
        <img width="0" height="0" src="http://blog.invalidoperation.com/aggbug.ashx?id=5b20c588-314d-4ce2-b9f1-ed07b4fb7793" />
      </body>
      <title>Database Metadata in .Net Part 1 : The Problem</title>
      <guid isPermaLink="false">http://blog.invalidoperation.com/PermaLink,guid,5b20c588-314d-4ce2-b9f1-ed07b4fb7793.aspx</guid>
      <link>http://blog.invalidoperation.com/2009/02/16/DatabaseMetadataInNetPart1TheProblem.aspx</link>
      <pubDate>Mon, 16 Feb 2009 02:16:00 GMT</pubDate>
      <description>&lt;p&gt;
For most ORM or data layer utilities there is some component of code generation involved
and somewhere along the line retrieving some metadata about the underlying structure
of a database is usually involved. Seeing as it relates to &lt;a href="http://blog.invalidoperation.com/2008/12/11/CurrentProjectNDeavor.aspx"&gt;NDeavor&lt;/a&gt;,
I figured I’d write about the ins and outs of getting metadata when running close
to the database. Basically, there are a few basic ways to access database specific
metadata directly through .Net.
&lt;/p&gt;
&lt;h5&gt;Vendor-specific API.
&lt;/h5&gt;
&lt;p&gt;
I only know of one comprehensive API for this and it’s &lt;a href="http://www.microsoft.com/downloads/details.aspx?familyid=d09c1d60-a13c-4479-9b91-9e8b9d835cdc&amp;amp;displaylang=en"&gt;Sql
Management Objects (SMO)&lt;/a&gt; which is Sql-Server only. A tremendously useful API for
any Microsoft-dominated shops, though. Enumerating through basic structures like tables
and columns is incredibly easy, if a little slow. Getting the table name, column name,
and data type name of every column in every table in the &lt;a href="http://www.codeplex.com/ChinookDatabase"&gt;chinook&lt;/a&gt; database
might look like this:
&lt;/p&gt;
&lt;pre class="c#" name="code"&gt;public void PrintChinookTableColumns()
{
    ServerConnection sc = new ServerConnection(@&amp;quot;.\SQLEXPRESS&amp;quot;);
    sc.Connect();
    Server server = new Server(sc);
    Database chinookDb = server.Databases[&amp;quot;chinook&amp;quot;];
    server.SetDefaultInitFields(typeof(Table), &amp;quot;IsSystemObject&amp;quot;);
    foreach (Table table in chinookDb.Tables)
    {
        if (table.IsSystemObject)
            continue;

        string tableName = table.Name;

        foreach (Column col in table.Columns)
        {
            string columnName = col.Name;
            string dataTypeName = col.DataType.Name;

            Console.WriteLine(&amp;quot;{0} {1} {2}&amp;quot;, tableName, columnName, dataTypeName);
        }
    }
    sc.Disconnect();
}&lt;/pre&gt;
&lt;p&gt;
&amp;#160;
&lt;/p&gt;
&lt;h5&gt;Querying vendor-specific system tables. 
&lt;/h5&gt;
&lt;p&gt;
&lt;strong&gt;&lt;/strong&gt;Vendor lock-in at it’s finest, and typically unsupported at that.
Usefulness is dependent on what level of information is available and the amount of
time you’re willing to spend figuring out what the various columns and values mean.
Getting table, column, and data type names in a specific database in Sql Server Express
2005 might look like this:
&lt;/p&gt;
&lt;pre class="sql" name="code"&gt;use chinook
go

SELECT  T.name as TableName, 
        C.name as ColumnName,
        ST.name as DataType
FROM 
    sys.tables T 
    INNER JOIN sys.columns C on T.object_id = C.object_id
    INNER JOIN sys.systypes ST on C.system_type_id = ST.type&lt;/pre&gt;
&lt;h5&gt;&amp;#160;
&lt;/h5&gt;
&lt;h5&gt;Querying &lt;em&gt;standard&lt;/em&gt; metadata tables.
&lt;/h5&gt;
&lt;p&gt;
&lt;strong&gt;&lt;/strong&gt;Emphasis on ‘standard’, because there really is none. There is a
spec as part of the SQL standards that came along sometime around SQL92/SQL99 (not
sure) which dictates the INFORMATION_SCHEMA tables which store very basic metadata
about the tables, views, columns, procedures, functions, parameters, constraints,
etc, of a specific relational database. Support for these is relative to how much
the vendor cares about doing so. Sql Server and MySql both have pretty strong support
for these. But even then, the spec is very loose and while simple queries to say,
get all table names and column names, might be portable, more complex ones likely
won’t be. There’s other discrepancies as well, for instance the verbiage around ‘schema’
and ‘catalog’. In MySql terms &lt;em&gt;schema&lt;/em&gt; is analogous to a Sql Server &lt;em&gt;database&lt;/em&gt;,
and this translates down into their implementation of these metadata tables. Back
to the example of tables, columns, and data types in the chinook database:
&lt;/p&gt;
&lt;pre class="sql" name="code"&gt;Sql Server: 
use chinook 
go 

SELECT 
    T.TABLE_NAME as TableName,    
    C.COLUMN_NAME as ColumnName, 
    C.DATA_TYPE as DataType 
FROM 
    INFORMATION_SCHEMA.TABLES T 
    INNER JOIN INFORMATION_SCHEMA.COLUMNS C ON 
        T.TABLE_NAME = C.TABLE_NAME AND 
        T.TABLE_SCHEMA = C.TABLE_SCHEMA AND 
        T.TABLE_CATALOG = C.TABLE_CATALOG

 

MySql: 
SELECT 
    T.TABLE_NAME as TableName, 
    C.COLUMN_NAME as ColumnName, 
    C.DATA_TYPE as DataType 
FROM 
    INFORMATION_SCHEMA.TABLES T 
    INNER JOIN INFORMATION_SCHEMA.COLUMNS C ON 
        T.TABLE_NAME = C.TABLE_NAME AND 
        T.TABLE_SCHEMA = C.TABLE_SCHEMA 
WHERE T.TABLE_SCHEMA = 'chinook';&lt;/pre&gt;
&lt;h5&gt;&amp;#160;
&lt;/h5&gt;
&lt;h5&gt;Using the schema-related functionality in Ado.Net 2.0. 
&lt;/h5&gt;
&lt;p&gt;
Introduced in .Net 2.0, the DbConnection base class has a method with a few overloads
called GetSchema(). This is supposed to allow implementers of Ado.Net providers a
way to provide metadata about the underlying connected data store. Since it’s not
strongly typed, and consists of returning DataTables, it’s entirely up to the implementer
as far as what information is provided. Typically calling the parameter-less version
of GetSchema() would return a DataTable that describes what metadata is available,
indicated by a string column called something like &lt;em&gt;MetaDataCollection&lt;/em&gt; in
which you can pass to subsequent calls to GetSchema(string) to get that specific chunk
of metadata. There’s a decent write up of what’s available in the native Ado.Net providers
located &lt;a href="http://msdn.microsoft.com/en-us/library/kcax58fh.aspx"&gt;here&lt;/a&gt;.
Here’s an extension method to get all the metadata into one dataset for further manipulation,
which should work for the built-in Ado.Net providers, as well as MySql.Data.MysqlClient
using the latest &lt;a href="http://dev.mysql.com/downloads/connector/net/5.2.html"&gt;MySql
Net Connector&lt;/a&gt;:
&lt;/p&gt;
&lt;pre class="c#" name="code"&gt;using System.Collections.Generic;
using System.Data;
using System.Data.Common;
using System.Linq;

namespace System.Data
{
    public static class DbConnectionExtensions
    {
        public static DataSet GetSchemaSet(this DbConnection connection)
        {
            DataSet result = new DataSet();

            DataTable collections = connection.GetSchema();

            List&amp;lt;string&amp;gt; availableCollectionNames = (from row in collections.AsEnumerable()
                                                     select row.Field&amp;lt;string&amp;gt;(&amp;quot;CollectionName&amp;quot;))
                                                     .Distinct()
                                                     .ToList();

            foreach (string collectionName in availableCollectionNames)
            {
                try
                {
                    DataTable schemaTable = connection.GetSchema(collectionName);
                    result.Tables.Add(schemaTable);
                }
                catch { }
                // this is bad form and shouldn't be necessary but SqlClient seems to try to give Sql2008 metadata for 
                // Sql2005 connections which results in unneeded exceptions
            }

            return result;
        }
    }
}&lt;/pre&gt;
&lt;p&gt;
The other mechanism for getting metadata is the GetSchemaTable() method on the System.Data.IDataReader
interface, which provides some limited information on the metadata for the columns
that exist in a particular instance of IDataReader, typically from a call to DbCommand.ExecuteReader().
Again, GetSchemaTable(), as you might have guessed, returns a DataTable, this time
with structural information only relating to the columns of the IDataReader itself.
This, combined with DbDataReader.GetName(int) and DbDataReader.GetDataTypeName(int)
can give some pretty useful information for table and column structure returned from
a select statement. That being said, unless you’re reverse-engineering result sets
from stored procedures, you’ll likely find the DbConnection.GetSchema() functionality
much more effective.
&lt;/p&gt;
&lt;p&gt;
If you’re using the System.Data.OleDb provider, there’s the GetOleDbSchemaTable method
on the OleDbConnection class which provides its own metadata about the underlying
database. It works a little differently and takes a System.Guid value which refers
to the type of metadata being requested, all of which are static fields in the System.Data.OleDb.OleDbSchemaGuid
class.
&lt;/p&gt;
&lt;pre class="c#" name="code"&gt;using System.Collections.Generic;
using System.Data.OleDb;
using System.Linq;
using System.Reflection;

namespace System.Data
{
    public static class OleDbConnectionExtensions
    {
        public static DataSet GetOleDbSchemaSet(this OleDbConnection connection)
        {
            DataSet result = new DataSet();
            List&amp;lt;FieldInfo&amp;gt; guidMembers = typeof(OleDbSchemaGuid).GetFields(BindingFlags.Static | BindingFlags.Public).ToList();

            foreach (FieldInfo field in guidMembers)
            {
                if (field.FieldType == typeof(Guid))
                {
                    Guid val = (Guid)field.GetValue(null);

                    try
                    {
                        DataTable schemaTable = connection.GetOleDbSchemaTable(val, new object[] { });
                        result.Tables.Add(schemaTable);
                    }
                    catch { } // unfortunately not all schema guids supported by all oledb connections necessarily
                }
            }

            return result;
        }
    }
}&lt;/pre&gt;
&lt;h5&gt;Oddities and Errata
&lt;/h5&gt;
&lt;p&gt;
Under .NET 3.5 (maybe SP1 only), if you execute .GetSchema() on a System.Data.SqlClient.SqlConnection
connected to a Sql Server 2005, you’ll notice there's a row indicating that the metadatacollection
called &lt;em&gt;StructuredTypeMembers&lt;/em&gt; is available. However, this is Sql Server 2008-specific
and will throw an exception if you call .GetSchema(“StructuredTypeMembers”) on that
same Sql 2005 connection. Why Microsoft let it even be exposed during calls to .GetSchema()
on non-2008 connections, who knows. They already have a precedent for exposing Sql
2008-specific metadata depending on 2005/2008 server version as indicated &lt;a href="http://msdn.microsoft.com/en-us/library/ms254969.aspx"&gt;here&lt;/a&gt; so
it seems pretty stupid on their part. That’s the reason for the empty catch statement
in the code block above.
&lt;/p&gt;
&lt;p&gt;
Using &lt;a href="http://dev.mysql.com/downloads/connector/net/5.2.html"&gt;MySql Connector
Net 5.2.5&lt;/a&gt;, the schema collection of &lt;em&gt;Foreign Key Columns&lt;/em&gt; is available
but not indicated by the resulting DataTable returned by MySqlConnection.GetSchema().
I notified &lt;a href="http://www.reggieburnett.com/"&gt;Reggie Burnett&lt;/a&gt;, who maintains
the Net Connector, and he says it’s been fixed, so I’d expect to see that in a future
release.
&lt;/p&gt;
&lt;p&gt;
Getting the structure of result sets from stored procedures or functions is even more
difficult than any of the basic table/column structure stuff I’ve mentioned. Using
Sql Server, you can use the &lt;em&gt;SET FMTONLY ON &lt;/em&gt;statement before the stored procedure
call, and &lt;em&gt;SET FMTONLY OFF &lt;/em&gt;afterwards, in order to get an empty result set
from which you can derive schema without executing against live data. However, to
do that in code, you’d have to derive their parameters using some other means like
GetSchema() and build a statement block with the FMTONLY lines surrounding the procedure
call. Oh, and none of this works if the stored procedure uses temporary tables to
return that result set. And again, it’s Sql Server specific. For MySql, the only way
I’ve figured out is to:
&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
Derive the parameter information using MySqlConnection.GetSchema(“Procedure Parameters”) 
&lt;/li&gt;
&lt;li&gt;
Construct a MySqlCommand with command text equal to the procedure name and CommandType
set to CommandType.StoredProcedure 
&lt;/li&gt;
&lt;li&gt;
Add phony parameters using the previously-derived information and set their values
to DBNull.Value. 
&lt;/li&gt;
&lt;li&gt;
Execute the command via ExecuteReader() 
&lt;/li&gt;
&lt;li&gt;
Reverse engineer the schema using GetName(int), GetDataTypeName(int) on the MySqlDataReader,
in tandem with GetSchemaTable(). Theoretically, if the information in the schema table
indicates the column came from a physical table, one could go to the source and look
at the values of MySqlConnection.GetSchema(“Columns”) for the source column definitions. 
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;
That’s likely how I’ll be doing it in the NDeavor providers where stored procedures
are supported like Oracle and Postgres. It’s less than ideal, but if you’re still
following this, you’d realize that’s my point! 
&lt;/p&gt;
&lt;h5&gt;Adding It Up
&lt;/h5&gt;
&lt;p&gt;
There’s a pattern to getting at your metadata and that pattern is it’s a pain in the
ass. I’m hoping NDeavor will provide an easy means of getting database schema out
of various platforms for the purposes of artifact-generation. But even so, I’m not
exactly planning on writing each NDeavor provider as its own one-off implementation
because there’s such inconsistency between metadata across platforms. I have a way
forward for a more lower-level means of getting at this stuff that solves a lot of
the cross-cutting discrepancies, and that will be in part 2.
&lt;/p&gt;
&lt;img width="0" height="0" src="http://blog.invalidoperation.com/aggbug.ashx?id=5b20c588-314d-4ce2-b9f1-ed07b4fb7793" /&gt;</description>
      <comments>http://blog.invalidoperation.com/CommentView,guid,5b20c588-314d-4ce2-b9f1-ed07b4fb7793.aspx</comments>
      <category>Ado.Net</category>
      <category>database</category>
      <category>metadata</category>
      <category>MySql</category>
      <category>NDeavor</category>
      <category>Sql Server</category>
    </item>
  </channel>
</rss>