I received detailed feedback based on my December 6, 2010 presentation Data Mining for SQL Server Professionals. As I posted on this blog, the whole presentation has been archived for video. I intend to continue this type of presentation for 2011 for SQL Server Professional audiences (so feel free to send in what feedback you have).
My goal in the presentation was providing an overview of what is available in this complex service technology (since SQL Server Data Mining is not an application but instead a service). Data mining should result in more questions than answers, but hopefully the questions people have after seeing a presentation would be of higher quality (as measured by what is currently available through past collective industry experience or academic research).
In this blog post, I have taken the recently received feedback (from an experienced presenter and SQL Server leader) and turned the ideas into a Q&A (question and answer) for what a SQL Server professional might want to know or ask. In total, I took the excellent feedback and constructed eight questions and answers, which also incorporate previous blog posts. In this post, the letter Q represents question, and the letter A represents answer.
Q: What is the logical/physical structure behind SQL Server Data Mining (SSDM)? For logical I would want to see the logical relations, relationships, functions, stored procedures and how they related together to facilitate DM. For physical I would want to see the actual tables and columns, views and keywords that give access to the physical objects etc.
A: I will provide some links for full documenation, and the scope of the entire architecture is beyond an hour-long presentation. Some of that structure is not documented or even accessible (meaning readable). Even SQL Server professionals can assume that Microsoft may change or improve internal mechanics among versions. In the past, I have had discussions with Microsoft about this topic, and they officially have two standards: what they might publish in Books Online (the official word) and what they leak through as undocumented features on the forums and perhaps in books (like the one authored by Jamie MacLennan and Bogdan Crivat for the 2008 version). Microsoft’s SSDM architecture supports functionality rather than provides comprehensive processing mechanics (which they may internally improve). Even at this level, I believe that Microsoft has been comparatively transparent relative to competing technologies, likely because the patents surrounding the algorithms provide a barrier, and because Microsoft wants developers to have good .NET access through ADOMD .NET and AMO. This issue is a good question, one which no one has asked at an event.
Link: Planning and Architecture (Analysis Services – Data Mining)
Continue reading “Data Mining Answers for SQL Server Professionals” »