Seen as the sexiest job of the 21st century, data science has emerged as one of the most in-demand professions of recent years — taking hold with a hype that normally only surrounds celebrities. Companies worldwide put lucrative salaries, prestige and the privilege of wielding influence up for grabs to attract analytical talent. Behind all the hype is a growing importance of digital data that’s currently transforming the way we live and work.
It’s no wonder that more and more enthusiasts want to break into this new field. But before venturing into data science and analytics with one’s eyes closed, aspirants are well advised to inform themselves about available routes first. Interested candidates are encouraged to begin their journey by identifying entry points and requirements, by finding out more about how the various data subfields differ from one another, and how their CV needs refinement prior to submitting job applications.
Pursuing this train of thought further, one question naturally always arises: What exactly distinguishes the titles of data scientist and data analyst? After all, both professions seem to be tasked with extracting business value from data. The logic savvy reader perhaps already knows that sharing similarities doesn’t imply interchangeability. One could complement the other, as both work in unison towards achieving a common goal. Many phenomena out there in the wild share resemblances in some areas and differ starkly in others.
The objective of this article is to find an answer to that question of difference. Not only from the perspective of theory, but also from the lens of Mohamed Warsame, a seasoned professional and Data instructor who’s seen how data teams are working in the real world.
Data analyst: Career, job description, professional skills…
It is important to fully understand what is the role of a Data analyst. So, let’s take a look at a representative Data analyst job description coming from adverts! An entry-level one reads:
The typical Data analyst role is consulting-centric, as can be seen from the Indeed job spec example.
What they are preoccupied with for the most part is:
- Wrangling data from Excel spreadsheets and SQL databases,
- Extracting insightful conclusions via retrospective analyses and A/B tests,
- Generally providing evidence-based business advice.
The last point illustrates why reporting routines with visualisation tools such as Tableau are as pivotal as pivoted tables. Data modelling on the other hand is often limited to basic supervised learning or its stats equivalent: regression analysis.
From experience, I can also say that novice practitioners sometimes forget that the stage of supplying recommendations is invariably the most important one. They can get side-tracked by buzzwords and trendy techniques far removed from the business context. That is why it’s so important that a Data analyst learns how to excavate insights which can be acted upon, presentable in both visually compelling and digestible formats. A Data analyst is a tech savvy investigative reporter who makes insights accessible.
Data scientist: Career, job description, professional skills…
Turning now to a similar example of a typical Data scientist role from Indeed, we’ll explore some of the key differences. The first noticeable one is the length of the ‘must have’ and role responsibility sections. Certainly, much more is demanded of the average Data scientist than it is for a Data analyst, which explains in part why the former command better salaries than the latter. But is there substance behind the hype or is data science merely a modern myth?
To be fair, Data scientists are for that reason expected to be more than analytical wizards (so they need to have more analytical skills!). These professionals are supposed to be builders who employ advanced programming to create pipelines that predict and recommend in production environments with near perfect accuracy thanks to predictive models. Compared with a Data analyst, who is like an investigative reporter, a Data scientist is a lot more product development than consulting oriented. Although it’s also required of a Data scientist to provide data-led commercial advice.
Some say the title was coined to manifest that the role was a confluence of three fields: math and statistical knowledge, computer science and domain expertise. And the following quote is said to best encapsulate that: “A Data scientist is someone who is better at statistics than any software engineer, and better at software engineering than any statistician.”
Data analyst vs Data scientist: what are the main differences?
Data scientist and data volumes
Greater volumes of data mean stakes are higher: and so are expectations, too. For unlike Data analysts, who would on average be given spreadsheets with 500 thousand rows and 50 columns to make sense of on their first day, Data scientists will likely see the keys to terabytes of data with tens of thousands of columns handed over to them on day one. Everyone would then expect them to magically summon the gems of insight and wisdom out of those volumes of data. Left to their own devices, they will be expected to ingest, transform, explore and model enormous volumes of messy and unstructured data. As some witty writers on medium have said: “Data scientist is a title that conjures up almost mystical abilities of a person garnering insights from deep data lakes with ease, someone who has supernatural hands for data like a 21st century Houdini!”
Data scientist and coding knowledge
Data science is a lot more coding intensive. Even though a Data scientist and a Data analyst obtain data with the same and familiar objective in mind, their approaches and tools used differ substantially:
- A Data analyst mainly works with SQL dialects to paste manageable chunks of data into spreadsheets and programming interfaces like R Studio and Jupyter Notebooks
- A Data scientist is expected to be comfortable with working in cloud computing settings (AWS, Databricks, Hadoop, etc.).
There they ingest, process and model volumes of data whose magnitude is often referred to as Big Data. In view of that, it’s easy to see why data science jobs add those ridiculously long lists of tech-stack requirements. New hires in larger organisations inevitably inherit heaps of sometimes undocumented legacy scripts and custom algorithms that they either need to replace or maintain. With that in mind, it’s no wonder that advanced programming skills are a must-have, where it is only good to have in most entry-level data analyst positions.
Data analyst and business stakeholders
Data analysts are a lot more connected to business stakeholders. For as we’ve discovered in our long exposition of the differences, Data analyst jobs are in actual fact less coding intensive, which reveals a rather more subtle point. The careful observer of the tech world would confirm that technical complexity almost always comes with barriers. They create voids between decision making stakeholders and hands-on engineers and scientists. That, in turn, is the space which product managers fill to bridge the gap in communication. And since data science work is commonly surrounded by a fog of mystery, ordinary employees of a firm tend to prefer to reach out to analysts for help.
It’s a phenomenon I’ve frequently noticed in the world of business: a Data scientist tends to be more siloed. A Data analyst, on the other hand, tends to be more involved and engaged with other business units, readily helping with issues such as:
- fixing Excel spreadsheets,
- aiding client pitches with analytical teasers,
- contributing to overall business performance with dashboards.
We can see here that a Data analyst position is more of a consultant who likes to make a difference in the micro context.
Now that we identified the main differences between these two professionals, let’s take a look at their similarities!
Data analyst vs Data Scientist: what are the main similarities?
By affirming the few aspects which they share and outlining where they diverge, we get closer to grasping both roles better. A data science stack-exchange blogger registered as Stephan Kolassa attempted to visually demarcate differences by using a Venn Diagram (Entry 2403).
A plethora of noteworthy points can be inferred from this diagram. Among the rather more obvious that Data scientist and Data analyst roles are closely related, occupying quadrants adjacent to one another. That, in practice, means that they should always be found working in the same business units. Except in the situation where the Data analyst is bound to specific project teams as part of squads in agile frameworks.
Data insight provision for commercial advantage
Now that we settled the respective roles of these professionals, we finally realize that both use data in the service of business goals, and both need expertise in traditional statistics.
High communication skills
There also is this quadrant of communication that a Data scientist and a Data analyst mutually share: conveying useful insights to business leaders by means of data stories (telling a story by interpreting data), or creating intuitive tools which bring about ‘data-driven’ decision making.
That is the main reason why you’ll find visual, as well as verbal communication skills demanded in almost every job advertisement for both jobs.
A growing popularity trend in the public eye
In the following chart, we used a Python script to load Google search terms data from a freely available source.
We can see that the two job roles have yet another commonality. The kindred professions have witnessed a similar popularity trend in recent years, an explosive one for that matter.
By the way, if you’ve ever wondered, Google trends data comes from unbiased samples of individual search engine queries — anonymised, categorised and grouped geographically to measure public interest in particular topics. A great enhancement of this data has been released in 2016, making sentiment across all subject categories available in real time. I happen to have years of experience leveraging that data.
Data Analyst vs Data Scientist – Conclusion
All things considered, we’ve explored how the professions of Data scientist and Data analyst compare and contrast. We’ve reached an understanding on how they differ in programming intensity, data volumes used in modelling, sophistication in regards to automation, and the educational backgrounds required by looking at sample job advertisements. Sure, we anticipated differences. But surprisingly, we’ve also come to appreciate how similar the two kindred professions actually can be. In essence, they both seek to retrieve insights from datasets.
If you are one of those curious techies, you can try our Data analyst prep course for free!