Broadening horizons: from DBA to Data Science #1
As an IT professional we are used to spending a lot of time learning about new technologies or how we can improve our knowledge on the products we work with. For more than 10 years now I have been working with SQL Server and still continue to learn new things about the product daily. However, another technology is slowly gaining a spot in my IT heart, Data Science.
Around a year ago I took some careful steps to get a better understanding of the “BI” side of SQL Server. Keep in mind I am a pure “Engine” guy. This means I am perfectly comfortable with tuning queries or SQL Server instance performance, but terms like “cubes”, “ETL” and “BIML” scare me. Even though I have some basic knowledge of some of the products in the SQL Server stack, like SSIS and SSAS, it isn’t my primary business to deal with those tools, so my knowledge isn’t at the level I would like it to be. So I did what I am used to doing, start digging and try and learn as much about the BI area as possible.
Pretty quickly I ran, head-first, into a concrete wall. The BI area that is related to SQL Server is enormous! There are so many technologies, techniques and areas of expertise that I simply did not know where to start. So at the end of March I decided to post a simple question on Twitter, “where to start when you want to learn more about BI/Analytics”.
The amount of responses was overwhelming to say the least and I got some great directions where to start, but more importantly, I learned that the term “BI” is pretty subjective. For me this meant that I had to find my own definition of BI. I started with some of the tips I received and pretty quickly got in touch with R. From there a learned R is a (small) part of a much bigger picture, “Data Science”.
Road to Data Scientist by Swami Chandrasekaran
I found that many of the technologies and skills used in the Data Science area matched perfectly with my definition of BI, or should I say analytics, and so my journey into Data Science started…
Why switch?
This is a question I get a lot when I tell people I am working towards a better understanding of the Data Science areas. The answer is pretty short: there is no switch. I love being a DBA and the feeling I get when I optimize a query to go from 50 to 2 seconds is one of pure victory. Think of it more as broadening my horizon. I can already think of various scenarios where Data Science techniques can improve my DBA work, for instance, analyzing and visualizing query performance using R. It also works the other way around, having a solid understanding of the relational model and the SQL query language will definitely help in my Data Science study.
So why are you writing all this in a blog post?
When talking to various people about my plans (many also SQL Server DBAs), I noticed many of them are either already working with techniques like Machine Learning or are also planning to learn more about Data Science. Since there is so much information out there about the various parts of Data Science I decided to write down my experiences hoping this might help those that are thinking of following the same road I am taking right now.
It also has another advantage. Writing things down helps immensely to keep the knowledge in your head. This makes it a win-win situation, other people can learn from my experiences while I get an extra knowledge “check” when writing a blog post.
Learning path
I learned that it is important to have a plan when you decide to study an area that you are completely unfamiliar with. A plan should, in my opinion, always start with a goal, which is that place you want to reach when you start learning. In this case my goal is:
Using Data Science techniques to provide value and understanding from data
Since Data Science is a completely new area for me, which consists of many tools and techniques, I made for myself a learning path. For me following a path is very important. I can get carried away pretty quickly by specific subjects and a learning path can keep me on an efficient road to my goal.
For my learning path I use the Road to Data Scientist map by Swami Chandrasekaran which you can see above. It allows you to easily view the different Data Science areas and it suggests a learning order that makes sense to me.
Step 1
Since I prefer to understand the fundamentals before diving into more advanced topics, the first step I am working on is getting my mathematics back on the level I think is required. This means I have to work my way back to linear algebra again. The last math lessons I received are quite a few years ago, this means I have a lot of ground to cover.
The first resource I use to get my math up-to-date is a (dutch) math book that focusses on getting math knowledge up-to-date from a middle school level towards high school / university. There is an English version available of the book called “All you need in maths!”, but I am sure there are many other books that cover the same math areas.
Next to the book, another resource has already proven itself invaluable for learning math, Khan Academy. Khan Academy has an enormous amount of math related videos that are completely free and generally easy to follow.
Many people informed me math is not strictly necessary when you want to work with techniques like Machine Learning. Personally I want to know how things ”work” before I use them, so I decided to start with math first.
What’s next?
The best way to look at this blog post is as an introduction how I go in touch with Data Science and how I plan to learn more about it. In the next article I will go into more detail where I am now in my study, what I learned and where I am experiencing difficulties.
50 comments to “Broadening horizons: from DBA to Data Science #1”
Hey Enrico,
Very interesting post. I’m in a similar situation although we approached it in different ways. I look forward to more posts on this topic!
Clive
Nice to see I’m not the only SQL Server engine ‘person’ who want’s to get familiar with data science.
I’ve started with the big data science course on Coursera (Hopkins university). Now doing the R module. My statistics knowledge is also something that needs to be taken care of
Looking forward to your posts!
Thanks for the post! I also work in the SQL Server stack, and have just started a masters program in statistics for many of the same reasons you indicate. Love the visual you found… had not see that before. What a fun time to be in this space!
I too have just started Machine Learning. The Coursera course by Stanford is excellent and includes refresher modules on Linear Algebra. Andrew Ng breaks it all down into understandable small pieces and then builds on those. We have a study group – which also really helps.
Good to see this article, I’m in the same boat as well. Been a SQL DBA for over 10 yrs.Enrico, it will be interesting to know how your dealing with the road map to data science, please keep us updated.
Message *
I do consider all of the concepts you have presented in your
post. They are really convincing and can certainly work.
Still, the posts are too brief for newbies.
Could you please prolong them a bit from subsequent time?
Thanks for the post.
The finest odds have been on “fantastic,” “incredible,” “amazing” and “tremendous,” even though “not our fault” was near the bottom.
Good information. Lucky me I found your website by accident (stumbleupon).
I’ve saved as a favorite for later!
Definitely imagine that that you said. Your favourite reason seemed
to be on the web the easiest thing to be mindful of.
I say to you, I definitely get irked at the same time as people consider worries that they plainly do not realize about.
You controlled to hit the nail upon the top as neatly as defined out the entire thing without having side-effects , folks
could take a signal. Will likely be back to get more. Thank
you
What’s up, for all time i used to check blog posts here in the early
hours in the morning, because i love to learn more and more.
Unquestionably imagine that that you stated. Your favourite justification seemed to be on the net the simplest factor to
have in mind of. I say to you, I certainly get annoyed at the same time as other people consider issues that they
plainly don’t realize about. You controlled to hit the nail upon the top and outlined out the entire thing without having side-effects , other people
could take a signal. Will likely be back to get more.
Thanks
Thanks for finally writing about > Broadening horizons: from DBA to Data Science #1 – DotNine < Loved it!
May I simply just say what a relief to find someone who truly understands what they’re
discussing on the web. You certainly realize how
to bring an issue to light and make it important. More people must read this and understand this side of your story.
It’s surprising you’re not more popular since you certainly have the
gift.
Resorts Casino Hotelis property to theDraftKings Sportsbook at
Resorts.
Truly when someone doesn’t understand afterward its up to other users that they will help,
so here it occurs.
Betting competitions amongst Broncos fans will turn up the heat
in bettors’ ideal friends’ garages.
Why visitors still make use of to read news papers when in this
technological globe all is accessible on net?
State regulatory authorities have made 33 skins readily available,
linked with the state’s 33 brick-and-mortar casinos.
Inspiring story there. What happened after? Thanks! //bestbetfc.com
It’s not my first time to visit this web site, i am visiting this site dailly and get nice data from here daily.
Really no matter if someone doesn’t be aware of after that its up to other viewers that they will assist,
so here it happens.
Why viewers still use to read news papers when in this technological world everything is
accessible on web?
Very rapidly this site will be famous amid all blog viewers, due to it’s pleasant posts
It is telling that the job search app is cost-free, to encourage
extra use of it.
excellent submit, very informative. I’m wondering why the other experts of this sector
do not understand this. You must proceed your writing.
I am confident, you’ve a great readers’ base already!
Thanks for a marvelous posting! I really enjoyed reading it, you are a
great author. I will ensure that I bookmark your blog
and will eventually come back at some point. I want to encourage you to definitely continue your great
posts, have a nice day!
دمنوش لاغری دکتر بیز و خرید اینترنتی دمنوش لاغری دکتر بیز و اطلاعات دمنوش لاغری دکتر بیز و فواید دمنوش لاغری دکتر بیز
thanks for your site
best web desgin shiraz and digital marketing year 2021
مطلب بسیار مفیدی بود ممنون
با تشکر بسیار مفید فایده بود
فروشگاه اینترنتی ای فردا از شما تشکر میکند
tanx for website amoseshkhan
Hi, its nice paragraph about media print, we all understand media is a impressive source
of data.
you have Good article on this topic. plz read my article too
خرید دمنوش لاغری دکتر بیز
سوالات استخدامی های مهم ایران
//ramzifa.ir/6awc
حتما شنیدهاید که تریدرهای بزرگ اکثرا برای انجام ترید در فارکس، بورس و صرافیهایی مانند بایننس از سرور مجازی فارکس (VPS) استفاده میکنند. شاید با خودتان فکر کرده باشید که آیا واقعا استفاده از سرور مجازی ترید لازم است؟ در این نوشتار سعی میشود که به سوالاتی در این زمینه پاسخ داده شود.
You have a good site.
The article was also nice
You have a very good site. The article was also interesting
Good luck
It was a good article, do not be bored
hi this article very cool!
تور قشم
تور آف خورده قشم
تور قشم از اصفهان آسمان هفتم
تور قشم از تهران
تور قشم لحظه آخری
تور قشم با قطار علی بابا
تور لحظه آخری قشم
تورقشم
خرید اینترنتی تور قشم
تور آف خورده قشم از اصفهان
تور قشم هوایی
تور هوایی قشم از تهران
تور قشم علی بابا از اصفهان
تور آف خورده قشم از مشهد
تور قشم با قطار
تور قشم از اصفهان
تور لحظه آخری قشم از تهران
قیمت تور قشم از تهران
تور قشم علی بابا
قیمت تور قشم
رزرو تور قشم
تور قشم از اصفهان علی بابا
تور لحظه اخری قشم
The article was great, please continue this Morad again tahviehnazari.ir