Archive for the ‘Cool Technologies’ Category

New on “Database Trends & Applications”

Tuesday, August 16th, 2011

In last month’s column, “2012 Might Really Be the End of the World as We Know It,” I described a number of major developments in the IT industry that are likely to disrupt the life of database professionals everywhere.  I categorize those four disruptors – virtualization, cloud computing, solid state drives (SSD), and advanced multi-core CPUs – into two broad groups.  I’m going to continue an analysis of these disruptive technologies in inverse order.  Today, let’s discuss SSDs.

[READ MORE ON DATABASE TRENDS & APPLICATIONS ]

Microsoft Document Watch for Operational Excellence

Monday, August 8th, 2011

Back when my day-to-day duties included database administration work and enterprise architecture, I became rather obsessed with the idea of operational excellence.  I read everything I could on the topic.  I made a list of favorites, which became somewhat shabby over time, as I dog-eared important pages and scribbled notes in the margins.  (Perhaps that list of favorites might, in and of itself, make a good blog post).  Fast-forward a decade and I’m still mightily interested in operational excellence for IT organizations.  It’s just that so much good material is available for free on the web.

Here’s a run-down of several useful documents and downloads to improve overall operation performance for those of you in a Microsoft-centric IT organization:

Microsoft Operations Framework

Microsoft Operations Framework (MOF) version 4.0 guide is practical guidance for IT organizations. With the release of version 4.0, MOF now reflects a single, comprehensive IT service lifecycle—it helps IT professionals connect service management principles to everyday IT tasks and activities and ensures alignment between IT and the business.

Infrastructure Planning and Design

The Infrastructure Planning and Design (IPD) guides are the next version of Windows Server System Reference Architecture. The guides in this series help clarify and streamline design processes for Microsoft infrastructure technologies, with each guide addressing a unique infrastructure technology or scenario.

Microsoft Baseline Security Analyzer 2.2 (for IT Professionals)

The Microsoft Baseline Security Analyzer provides a streamlined method to identify missing security updates and common security misconfigurations. MBSA 2.2 is a minor upgrade correct minor issues and add optional catalog support.

Security Compliance Manager

The Microsoft Security Compliance Manager provides centralized security baseline management features, a baseline portfolio, customization capabilities, and security baseline export flexibility to accelerate your organization’s ability to efficiently manage the security and compliance process for the most widely used Microsoft technologies.

ETL Demo with Data From Data.Gov

Friday, August 5th, 2011

A little over a month ago, I wrote an article (Is There Such a Thing as Easy ETL) about expressor software and their desktop ETL application, expressor Studio.  I wrote about how it seemed much easier than the native ETL tools in SQL Server when I was reading up on the tool, but that the “proof would be in the pudding” so to speak when I actually tried it out loading some free (and incredibly useful) data from the US federal data clearinghouse, Data.Gov.

If you’d rather not read my entire previous article – quick recap, expressor Studio uses “semantic types” to manage and abstract mappings between sources and targets. In essence, these types are used for describing data in terms that humans can understand—instead of describing data in terms that computers can understand. The idea of semantic abstraction is quite intriguing and it gave me an excuse to use data from data.gov to build a quick demo. You can download the complete data set I used from the following location: International Statistics.  (Note: I have this dream that I’m going to someday download all of this free statistical data sets, build a bunch of amazing and high-value analytics, and make a mint.  If, instead, YOU do all of those things, then please pay to send at least one of my seven kids to college in repayment for the inspiration.  I’m not kidding.  I have SEVEN kids. God help me).

The federal government, to their credit, has made great progress in making data available.  However, there is a big difference between accessing data and understanding data. When I first looked at one of the data files I downloaded, I figured it was going to take me years to decrypt the field names. Luckily, I did notice an Excel file with field names and descriptions. Seriously, there are single letter field names in these files where the field name “G” has a description of “Age group indicator” (Oh Wow).  See the figure below.

It’s stuff like this that reminds me why we have data quality and master data management tools.  Ok, back to expressor Studio. I quickly mapped a couple of files into expressor Studio using their “Read File” operator. It was fairly simple and easy to use. My data included files with country area information, population, and gender information by year. Once I mapped these files I quickly wanted to shed the default cryptic, nay, nonsensical names. I could have just renamed the fields when I initially mapped them into the system but that would mean I would have to manage the names in three separate locations. Bah! It made more sense to create a common semantic type and reuse it across all three files.

There are two flavors of semantic types within expressor Studio to handle your mappings, atomic types or composite types. An atomic type is simply a single field name whereas a composite type is a combination of one more atomic types. Since the data files had many common fields, I decided to create a core set of atomic types that I could then roll up into composite types based on the files I was mapping. This kept the mappings simple and easy to understand and most importantly the whole exercise took about 5 minutes. Once the types were created I simply mapped the cryptic names from the files to the business friendly names in my semantic type.  (I can’t even begin to imagine how long this would’ve taken to build using native tools, but certainly not 5 minutes).

Now I was ready to move my data. I took the data from three files and combined them into one master dataset. From there, my international statistics from Data.Gov were pumped right into my waiting SQL Server database.  Note that I could’ve used Excel or just about any other database as my target instead of SQL Server.

Now, you might be saying to yourself “That looks easy because you read all the help files first.”  Actually, no.  In fact, some of my buddies like to lovingly tell me to “RTFM” from time to time.  It’s not that it offends my masculinity to read a manual.  I just usually like to have a go first and then, if needed, go back to the manual.  In fact, all I really used was this 5-minute demo video that in noticed when I was downloading the tool.

If you’re tackling ETL and you want it fast and easy, then you might want to check out their website, www.expressor-software.com, to learn more about the expressor company and products.

Enjoy!

-Kev

P.S. Follow me on Twitter!

 

 

Blogger, Have You Heard of Microsoft Broadcaster?

Friday, August 5th, 2011

 

Introducing the Microsoft Broadcaster: A self-service technical content repository that provides you with rich content for your blogs/sites.

On behalf of Microsoft, I would like to invite you to dive into Microsoft Broadcaster.  It’s a site that unleashes a torrent of free technical content like Videos, Webcasts, Whitepapers, eLearning, and more, which you can use in your blogs or user group site to drive deeper engagement with your audience and community.

Key Features of Microsoft Broadcaster include:

  • Download and embed content on your sites without sending traffic away from your site with Microsoft’s full blessing.
  • Customize site by product (e.g. Windows, SharePoint, or Office), by keyword, or by form factor (e.g. webcasts, videos, podcasts)
  • Set alerts for content updates.

To take advantage of the offer, visit www.microsoftbroadcaster.com.  For a more detailed information and program overview please visit http://www.microsoftbroadcaster.com/en-us/faq

Now, It’s time for me to get involved and start to participate in the program!

Enjoy!

-Kevin

P.S. Follow me on Twitter!

Quest Named to the SD Times 100 List

Thursday, August 4th, 2011

My employer, Quest Software, was once again named to the SD Times 100 list, this year in the “database and integration” category! The program recognizes the top leaders and innovators, looking back at each company’s achievements from the year before. Out unwavering market leadership was also reinforced in 2010 with IDC ranking Quest as “the top independent database management and development tools vendor serving distributed and other non-mainframe systems”·    Other innovations contributed to our win, such:

  1. Ongoing innovation within the Toad product family, including the introduction of Toad for Cloud Databases and OraOop in support of emerging database technologies
  2. The release of new tools tailored for the application developer, such as our Toad Extensions for Visual Studio and Eclipse
  3. Community initiatives like NoSQLPedia aimed at educating the traditional relational database professional on new database platforms
  4. And the launch of Project Lucy, our free Azure-based diagnostic service for SQL Server professionals

If you want to check out some of the buzz/congratulations taking place on Twitter for Quest and the other organizations that made the list, follow the #sdtimes100 hashtag.

Thanks for your support!  And if you’re not currently a customer of mine, but are curious, drop me a note for a free long-term license.

Enjoy,

-Kev

P.S. Follow me on Twitter!

What I’m Reading, July 22 2011

Thursday, July 21st, 2011

I read too much, and that, my friends, is an entirely separate topic for a blog post. But I thought I’d share with you a little more about what I’m reading because sometimes, if I’m lucky, it might be something you’d enjoy too.

So I’m going to start sharing what I’m reading at least once per week, partly so that I don’t firehose too many reading links directly into your brain (where I to do it say once per month) and partly to solidify in my own mind the information that I’m reviewing. So here are a few good links for the seven days leading up to July 22, 2001:

  • Microsoft and Whitehouse partnership on BigData: BigData isn’t a particularly new concept.  But I was intrigued to learn that the National Science Foundation, Microsoft, and 13 other teams were partnering on developing better BigData analytics for lots of government data from activities such as healthcare, economic development, education, transportation, and the power grid.  Cools stuff!  Plus, Microsoft has developed a new tool called Project Daytona to better harness the power of the cloud, in general, and Windows Azure, specifically.
  • While we’re on the topic of Federal IT in the Cloud be sure to read this linked article from ComputerWorld.  Say what you will about our government, but putting government IT in the cloud and increasing both its transparency and availability will make a huge difference in how the Federal government will be able to service the public.  We’re talking as big a difference as corporations experienced between the “catalog on the web” experience of the 1990′s to the Web2.0 experience of today.
  • If you’re the social media type, give this article a read discussing the Power of Hashtags in Social Media.
  • The Register, of the UK, whose tagline is “Biting the hand that feeds IT” has a great article on a spat over database technologies between the IT sage Michael Stonebreaker and Google.  It’s a great read if for no other reason than to prove that databases are worth fighting over.
  • And if you think Microsoft is still towing the relational database barge without thinking about other technologies, you need to read up on Projects Dryad and Daytona.
  • Finally, I’m still getting lots of questions about when and where to limit SQL Server’s Max Degrees of Parallelism.  Be sure to read Microsoft’s Recommendations and Guidelines for ‘max degree of parallelism’ configuration option here.

And just because so many of us in IT are closet or former musicians, there’s Live Guitar Lessons with Steven Krenz, sponsored by my hometown boyz at Gibson Guitar.

Got a favorite article or tool tip? Let me know!  Enjoy,

-Kev

Follow me on Twitter.

Microsoft Windows Platforms Blog Watch

Wednesday, July 20th, 2011

Remote Desktop Services Component Architecture Poster

Grab your own poster! A visual guide to key Remote Desktop Services technologies in Windows Server 2008R2

 

Virtually Free

Get the latest update rollup package for the Hyper-V role in Windows Server 2008 R2 and be sure to bookmark the Windows Virtualization Team Blog.  Be sure to check out blog World Simplified is a Virtual World. And doncha evah neglect application virtualization, such as the goodness at the App-V Product Team Blog.

 

Let’s Optimize Some Desktops (Assuming You Have Gone Full Cloud Yet)

Check out The Official Microsoft Desktop Optimization Pack (MDOP) Blog where you can get cool tools like the Diagnostics and Recovery Toolset (DART), currently in its v7 beta release.

 

We Can Always Perform a Little Better

Have you checked out the Ask the Performance Team blog yet.  You should!  Then there’s the Microsoft Enterprise Networking Team blog.  And if you’re looking for help with the Windows Server Core, be sure to check out the Ask the Core Team blog.  Excellent stuff!

As the Tool Time columnist at SQL Server Magazine, I’m always on the lookout for great free tools that get on-going support from their creators.  One common experience is finding a nice debugging tool, only to discover that there’s no information on how to interpret the debugger result sets!  (SQLIOSim anyone?)  That’s why I love the Microsoft Advanced Windows Debugging and Troubleshooting blog.  Another must-have on your Favorites list.

 

Enjoy!

-Kev

Follow me on Twitter

 

 

Is There Such a Thing as Easy ETL?

Thursday, June 30th, 2011

E.T.L. That’s Extract – Transform – Load.  That doesn’t sound like a lot of work when all you need to get loaded is a simple Access database or an Excel spreadsheet.  In a situation like that, the process is so simple, all you really need to focus on is the L in ETL.  There’s not a whole lot of E.T. to process, despite how wonderful that movie is. [pun intended]  But as soon as your data loading process involves some difficult or sophisticated cleansing or transformations, it gets really, really hard.

The other cross-thread that had really caught my interest lately is the USA federal governments Open Data Initiative.  I think it’s remarkable that President Obama is the first president to appoint a federal CIO.  (Shouldn’t that have happened in the past?)  In addition, President Obama instructed the entire executive branch to open up their data (where security isn’t at risk) and make it readily available to the public.  And the US government collects mountains of interesting and valuable data for its own uses, but figuring out how or who to share it with was always an afterthought.  While I was a contractor for NASA, for example, I worked on some incredibly interesting projects which yielded amazing and commercially valuable information.  It was all public domain.  But unless you knew it was there, you couldn’t get to it. Making use of all of that data always intrigued me.

Now, with ODI, it’s all being put on the internet at an ever-increasing rate at Data.gov.  However, all of this data, while open and available, is not standardized.  Some data sets might be a CSV file, while others might be something like a spreadsheet.  That means you’ll need to extract, transform, and load that data if you want to synthesize more valuable data sets.

For those reasons, I’ve been researching tools to help make this process easier.  (I also wanted to research SSIS and ETL tools for my Tool Time column in SQL Server Magazine.)  Now, I’ve been following expressor software for quite some time and really like their unique approach.  (I actually ran into the expressor software team at a PASS Summit one or two years ago and asked for a demo of their software.  And I really liked what I saw.)  Rather than the workflow approach used by SSIS, expressor software uses a data mapping approach combined with reusable business rules.  Their mapping approach is fundamentally different from the traditional point-to-point, source-to-target mappings paradigm.  Basically, you can define a semantic type representative of your business data, create a business rule(s) to apply to the data, and then implement a “canonical” mapping which connects data sources and targets to that same semantic type.  And it’s free!

 

Abstraction is Awesome

What’s cool about that?  Don’t forget that “semantic” means “meaning”.  So a semantic type is an abstraction of the meaning of the data.  The net result is that expressor shields your data integration application, with its associated business and transformation rules, from changes that might occur to underlying target or source files with different field names and data type representations have to be processed. 

For example, let’s assume that you need to process invoices from different vendors in slightly different formats.  If you use a traditional ETL tool like SSIS, any changes in the source and/or target formats will require you to modify your data mappings and transformation rules, because the mappings are tied directly to the metadata structure of the invoice file format(s). expressor, on the other hand, lets you define a common “invoice” semantic type, build all your downstream data processing off that type and map one or multiple invoice file schemas to the type.

This approach greatly simplifies the mapping process and provides for more flexible data integration applications that can be more easily adapted to changes in the source and target data sources.

expressor Studio Desktop

 

 

Benefits Abound

Since the semantic types in expressor are captured as reusable artifacts, you can also reuse them again in new data flows within your project(s).  You can even share them across your entire organization.  As I tinkered with the expressor Studio tool, I hit on a few other benefits with this approach:

  • Handles data type conversions automatically without having to write data transformation rules for these conversions
  • Builds new semantic types from existing types and reuses types in existing and new applications
  • Creates multiple, reusable business rules against a single type and applies them repeatedly as needed
  • Easily implements data quality rules and constraints

In an Ideal World…

In an ideal world, I’d figure out some brilliant way to make money from bringing together all kinds of that government data that I used to work with.  Other folks are doing it at the Windows Azure Data Market.  But in the meanwhile, I’m also looking forward to tinkering with this data to build better demos.  Along the way, I’m going to use the expressor Studio desktop ETL tool (Did I mention that it’s free!) as well as tell you about my experiences as I try to build out some Data.gov data sets.

Those of you who know me, know that I look a good discussion and cooperative, constructive team work.  So I encourage your feedback and suggestions, as I work through these data integration challenges and share my experiences.  I’m looking forward to sharing with you my insights on what the expressor data integration software can do with this challenge and what some of its features and capabilities are.  In upcoming releases, I’ll let you know what I find intriguing and worth mentioning.

Check out their website, www.expressor-software.com, to learn more about their company and products.

Enjoy,

-Kev

Follow me on Twitter

New Releases in the Toad Family of Tools

Wednesday, June 8th, 2011

Toad Extension for Visual Studio 1.2 is now live. You can access it HERE.

Key new features in Toad Extension for Visual Studio v1.2:

  • Deployment Scripts: the pre- and post-deployment scripts of TFS are now supported.
  • XML Columns: supports tables with XML data type columns.
  • Nested Tables: supports tables with up to two levels of nested table type columns.
  • Synonyms: synonyms can now be imported into your DB project
  • Source Navigation: You can now use F12 navigation to find the definition of items in the code editor.
  • Stored Procedures: generate Oracle stored procedures that use insert, update, or delete statements.

 

Toad for Data Analysts 3.0 us now in free beta. You can access it HERE or at ToadWorld.com

Discover, query, and analyze data across any data source without being a SQL expert.  Toad for Data Analysts (TDA) is a data discovery tool designed for data management professionals and analysts who need to access a wide variety of traditional, non-traditional, and emerging data sources to visually run queries, federate and analyze data, and create reports to support analytics and business intelligence.  With Toad for Data Analysts 3.0, you can:

  • Easily discover data across disparate data sources including leading RDBMS platforms, all ODBC enabled sources, and Netezza, Teradata, Excel, Hadoop and NoSQL
  • Harness the power of SQL to visually create complex queries, visually analyze data, and create reports for self-service business intelligence and analytics without being a  SQL expert
  • Work offline or on-the-go with Toad’s embedded database where you can integrate and cleanse data right from your desktop
  • Dramatically increase productivity by automating reporting and sharing of information and reports

 

It’s old news, but two other products in the Toad family have seen a recent refresh:

Benchmark Factory for Databases v6.5 You can access it HERE.

The highlight of this release is wizard-driven workload capture and replay for Oracle, enabling users to accurately anticipate and measure the performance impact of database changes such as patches and upgrades, OS migrations and more. Benchmark Factory is a very flexible, simple database scalability testing tool and, if you’re an Oracle user, also less expensive than Oracle Real Application Testing.  If you’re like me and want shortcuts, watch THIS VIDEO to see what the tool can do without the need to download, install, and tinker with it.
Quest Code Tester for Oracle v2.0 available HERE.  I wish we had this for the other database platforms!

The highlight of this release is the new Test Explorer console that replaces the Test Dashboard. The Test Explorer offers a simplified workflow and greater usability in creating and managing the entire code testing process.

NOSQL- A Quick Overview

Thursday, June 2nd, 2011

Several attendees at the SQLRally were asking about NoSQL (“Not Only SQL”) and its benefits. This article gives a quick overview: HERE.

Also, two good resources have come out from my friend Guy Harrison. This piece on Cassandra, one of the most popular NoSQL databases, was published a while back in Database Trends and Applications Magazine. I also really liked these entries from Guy at TechRepublic and TechTarget.  Guy also had the good fortune of having one of his articles published on GigaOm and then picked up in the New York Times Technology section.

The article, titled “Real World NoSQL: HBase at Trend Micro,” is the first in a five-part series Guy wrote, spotlighting NoSQL (non-relational) database deployments at five different companies. The other good learning experience for me was hearing about GigaOm for the first time.  GigaOm, now on my reading list, is considered an influential and prestigious publication in the NoSQL realm.

Let me know what you think.  Enjoy!

-Kev

~~~

Twitter at kekline

More content on my Blog