How To Grow As A Data Scientist Part 2

Benjamin Rogojan Benjamin Rogojan
January 9, 2018 Big Data, Cloud & DevOps
Ready to learn Data Science? Browse courses like Data Science Training and Certification developed by industry thought leaders and Experfy in Harvard Innovation Lab.
We wanted to follow up our previous piece about how to grow as a data scientist with some other skills senior data scientists should have. Our hope is to bridge the gap between business managers and technical data scientists by creating clear goals senior data scientists can aim for. Both entities have to take on very different problems. Both benefit when they are on the same page. This is why the previous post focused so highly on communication. It seems simple, but the gap between technical and business continues to grow as new technologies keep getting piled on every year. Thus, we find it important that managers and data scientists have a clear path of expectations.
Both business and IT knowledge are very specialized. However, due to this specialization of skills, most businesses see a gap between the two specializations. Our role is to help fill it!
We find that is beneficial when data scientists are starting their journey that they focus heavily on the technical aspects. This means programming, queries, data cleansing, etc. However, as data scientists grow. They need to focus more on design decisions and communication with management. This will multiply the impact of the more experienced data scientists knowledge. Instead of being stuck in the day to day of coding. They can make higher level decisions and help the younger data scientists if they get stuck. More experienced data scientists benefit both themselves and their companies more when they are utilizing their experience to help make design decisions that simplify complex systems, optimize data flows, and help make decisions on what projects are most pertinent.

Being Able To Simplify The Complex

Data scientists have a tendency to want to use every technique and algorithm they know on every problem and in every solution. In turn, this creates complex systems that are difficult to maintain.
Data science does require complex and abstract modeling as well as plethora of complex technologies (from Hadoop to Tensorflow). With all the complexity that surrounds the field, it is tempting to develop systems and algorithms that are in turn complex. There is the temptation to involve 4 or 5 different technologies and utilize every new hot algorithm or framework. However, like most other fields that have some engineering involved. Reducing complexity is often better for multiple reasons.
With all the complexity that surrounds the field, it is tempting to develop systems and algorithms that are in turn complex. There is the temptation to involve 4 or 5 different technologies and utilize every new hot algorithm or framework. However, like most other fields that have some engineering involved. Reducing complexity is often better for multiple reasons.
If John Nash, Erwin Schrödinger and Albert Einstein can help us understand the complexities of their very math and physics driven fields, then we data scientists can’t hide behind complexity.
The role of an engineer is to simplify a task. If you have ever built or seen a Rube Goldberg machine you will understand the idea of over engineering a simple task. Some data scientists algorithms and data systems would look more like some crazy mouse trap held together by duct tape and gum instead of an elegant but effective solution. Making simpler systems means the systems will be easier to maintain over time as well as provide future data scientists the ability to add and take away modules as needed. The next data scientist taking your position will thank you if you create a simple framework. On the other hand, if you use 3 different languages, 2 data sources, 10 algorithms and leave no documentation, then just know the future engineer is cursing your name under his breath.
Simple algorithms and systems also allow for easier additions and subtractions to be made. Thus, as technology changes and updates are required or a module needs to be taken out. A poor future data scientist isn’t stuck with playing a game of Jenga with your code. If I remove this block of code, will everything fall apart(have you heard of technical debt?)

Knowing How To Mesh Data Without Primary Keys

One of the big values strong data experts should provide is tying together data sets that might not inherently have a primary or obvious connection. Data can represent a person or business’s day to day interactions. Having the ability to find statistical patterns in this data is what allows data scientists the ability to help decision makers make wise choices. However, the data you would like to mesh together is not always on the same system or the same granularity.
Those who have worked with data will know it is not always integrated together nicely in one database. Finance data is often kept separate from IT Service Management data, and outside data sources might not have the same level of aggregation. This is a problem because to find value in data sometimes requires data from other departments and systems.
For example, what if you are provided medical claims, credit card and criminal rates of neighborhoods and want to figure out how these socio-economic factors affect the patient?. Some sets of data might be on a person by person level while the others might be on a street or city level with no clear method to connect the the data sets. What is the best way to proceed? This becomes a design issue that one, must be recorded and two must be thought out.
Each situation is different as there are many ways to mesh data. It could be based of region, traits, spending habits, etc. This is why experience is important. An experienced data scientist will have the intuition on how the data can be joined. Mostly because they have already tried hundreds of methods that don’t work. Often times, the closer you can combine both data sets to person by person the better. So if region or city happens to be the lowest level (Lowest level refers to granularity of the data, like person level, household level, street level, city level, state level, or many other groupings ) of connection, then that would be a great place to start.

Being Able To Prioritize Projects

As a data scientist, you have to know how to explain the ROI of projects that might not turn out. This is just about good direct communication(Our team will never stop talking about communication). This is about being able to articulate value as well as prioritize long term vs short term goals(again, easier said than done).
Teams will always have more projects and project requests than they can handle. More experienced team members need to take the lead and help their managers decide which projects are actually worth taking on. There is a fine balance between quick projects that might not have the highest ROI but have a good chance of succeeding and long term projects that are more likely to fail but also provide a large ROI.
In this case it is good to have a decision matrix of sorts to help simplify the process.
One of the classical decision matrices for projects is a 2 by 2 matrix that is importance and urgency. This matrix can be found in most business courses at college and it is really simple. That is why it is great!
I have worked at companies with really smart people. Yet, every project was treated as a priority and if you haven’t heard the saying, we will say it here.
If everything is a priority then nothing is.
Many other companies have this problem. This is why it is important for the experienced members of the data science teams to be to clearly articulate which projects really should be done now, vs later. Thus, using the simple matrix will do that.
(Like we said in our last post, being concise is important. Using the matrix to help specify ROI will help).
When there is concise and straightforward communication, projects continue to move forward and trust is built.

Being Able To Develop Robust And Optimal Systems

Making an algorithm or model that operates in a controlled environment is one thing. Integrating a robust model into a system that is live and deals with massive amounts of data is a whole other thing. Depending on the company, sometimes the data scientist will just have to develop the algorithm itself. Then either a developer or machine learning engineer will be responsible for putting it into production.
However, this is not always the case. Smaller companies, and smaller teams might have the data science team put the code into production. This means the algorithm needs to be able to manage the data traffic at a reasonable speed. If your algorithm takes 3 hours to run and needs to be accessed live. It is not going into production. Thus, good system design and optimization is necessary.
Data science is complex field that requires an understanding of data, statistics, programming, and subject matters. In order to grow, data scientists need to be able to simplify and distill these complexities into algorithms. They need to be able to focus more on making design decisions. This helps maximize their knowledge and experience that they have.

Summary

Senior data experts provide the largest impact for both themselves and their companies when they go beyond their technical abilities. The value they bring to the table is their experience, it can help guide younger developers to make better design decisions, and help managers make better decisions on which projects will have the best ROI. In turn, this magnifies the impact of their involvement on the team.

Call To Action

Are you an executive or director that needs help improving your communication between your data science team and your business owners? We want to help! Our team specializes in seminars to help improve communication and output of your data driven teams. Reach out to us today!
Interested In Reading More About Being A Better Data Scientist?
How To Grow As A Data Scientist
Boosting Bagging And Building Better Algorithms
How To Survive Corporate Politics As A Data Scientist
8 Top Python Libraries For Machine Learning
What Is A Decision Tree
  • Experfy Insights

    Top articles, research, podcasts, webinars and more delivered to you monthly.

  • Benjamin Rogojan

    Tags
    Big Data & Technology
    © 2021, Experfy Inc. All rights reserved.
    Leave a Comment
    Next Post
    Tone down your AI expectations

    Tone down your AI expectations

    Comments 65

    1. Emilio says:
      5 years ago

      There’s certainly a lot to find out about this topic. I like
      all of the points you made.

      Reply
    2. Jasper says:
      5 years ago

      If some one wants to be updated with hottest technologies then he
      must be pay a quick visit this web site and be up to date daily.

      Reply
    3. Chandra says:
      5 years ago

      Hiya! I know this is kinda off topic however I’d figured
      I’d ask. Would you be interested in trading links or maybe guest authoring a blog article or vice-versa?
      My blog discusses a lot of the same topics as
      yours and I think we could greatly benefit from each other.

      If you are interested feel free to shoot me an e-mail.
      I look forward to hearing from you! Wonderful
      blog by the way!

      Reply
    4. Barb says:
      5 years ago

      What’s up, just wanted to tell you, I enjoyed this article.
      It was helpful. Keep on posting!

      Reply
    5. Dixie says:
      5 years ago

      I’ve read a few excellent stuff here. Definitely price bookmarking for revisiting.
      I surprise how so much effort you set to make this sort of wonderful informative site.

      Reply
    6. Mitch says:
      5 years ago

      You could certainly see your expertise in the article you write.

      The sector hopes for more passionate writers such as you who
      are not afraid to say how they believe. At all times
      go after your heart.

      Reply
    7. รับสอน SEO says:
      5 years ago

      I pay a quick visit day-to-day some web sites and information sites to read articles or reviews, except this weblog gives feature based articles.

      Reply
    8. اطاريح دكتوراه says:
      5 years ago

      This site was… how do you say it? Relevant!! Finally I’ve found something that helped me.
      Thanks a lot!

      Reply
    9. كلية التربية الاساسية says:
      5 years ago

      Pretty great post. I just stumbled upon your blog and wanted to mention that I
      have really loved browsing your weblog posts. In any case I’ll be subscribing on your feed and I
      hope you write again soon!

      Reply
    10. اطاريح دكتوراه says:
      5 years ago

      A fascinating discussion is worth comment.
      I believe that you ought to publish more on this subject matter, it may not be a taboo matter but usually people don’t speak about
      such subjects. To the next! All the best!!

      Reply
    11. como pregar says:
      5 years ago

      Thank you for sharing your info. I truly appreciate your efforts and I am waiting for your further post thanks once again.

      Reply
    12. Fran says:
      5 years ago

      I’m more than happy to discover this great site.

      I wanted to thank you for your time for this fantastic read!!

      I definitely liked every bit of it and I have you book-marked to look
      at new stuff in your blog.

      Reply
    13. Louie says:
      5 years ago

      Thanks very interesting blog!

      Reply
    14. Klaudia says:
      5 years ago

      Keep on working, great job!

      Reply
    15. Janelle says:
      5 years ago

      There’s certainly a great deal to know about this issue.
      I love all the points you have made.

      Reply
    16. Joe says:
      5 years ago

      I do trust all the concepts you have introduced for your post.
      They’re really convincing and can certainly work.

      Still, the posts are very quick for starters. Could you please extend
      them a bit from next time? Thanks for the post.

      Reply
    17. Ludie says:
      5 years ago

      I always spent my half an hour to read this web site’s posts everyday along with
      a mug of coffee.

      Reply
    18. Evonne says:
      5 years ago

      Greate post. Keep writing such kind of info on your blog.
      Im really impressed by it.
      Hey there, You’ve done a great job. I’ll definitely digg it and in my opinion recommend to my
      friends. I’m sure they will be benefited from this website.

      Reply
    19. Patricia says:
      5 years ago

      Good post. I definitely love this website. Stick with it!

      Reply
    20. Corina says:
      5 years ago

      Appreciate the recommendation. Let me try it out.

      Reply
    21. Aurelio says:
      5 years ago

      Awesome! Its truly awesome post, I have got much clear idea concerning
      from this article.

      Reply
    22. Greg says:
      5 years ago

      Good way of describing, and nice paragraph to get information about my presentation focus, which i am going
      to convey in university.

      Reply
    23. Sherryl says:
      5 years ago

      There’s definately a great deal to learn about this subject.
      I like all the points you’ve made.

      Reply
    24. Aundrea says:
      5 years ago

      If some one needs expert view concerning blogging and site-building afterward i suggest him/her to visit this
      blog, Keep up the nice work.

      Reply
    25. Lloyd says:
      5 years ago

      Fine way of telling, and nice piece of writing to take data regarding my presentation focus,
      which i am going to convey in academy.

      Reply
    26. Delila says:
      5 years ago

      Hey There. I found your blog using msn. This is a very well written article.
      I’ll make sure to bookmark it and return to read more of your useful
      information. Thanks for the post. I’ll certainly comeback.

      Reply
    27. Epifania says:
      5 years ago

      Hello there! Do you use Twitter? I’d like to
      follow you if that would be ok. I’m undoubtedly enjoying your blog and look
      forward to new updates.

      Reply
    28. Dwayne says:
      5 years ago

      Why users still make use of to read news papers when in this
      technological world everything is accessible
      on web?

      Reply
    29. Phillis says:
      5 years ago

      Hello to every single one, it’s really a good for me to visit this website, it consists of
      useful Information.

      Reply
    30. Magda says:
      5 years ago

      What’s up, after reading this amazing article i am as well delighted to share my
      knowledge here with colleagues.

      Reply
    31. Tami says:
      5 years ago

      Good way of describing, and nice post to obtain information concerning my presentation subject, which i am going to present in university.

      Reply
    32. Lonnie says:
      5 years ago

      Hello there! I could have sworn I’ve been to this web site before but after going through some of the articles I realized it’s new to me.
      Regardless, I’m certainly happy I found it and I’ll be bookmarking it and
      checking back regularly!

      Reply
    33. Clifford says:
      5 years ago

      My family members always say that I am killing my time here at
      net, however I know I am getting know-how all the
      time by reading thes good articles.

      Reply
    34. Homer says:
      5 years ago

      Very nice post. I just stumbled upon your weblog
      and wanted to say that I have truly enjoyed
      surfing around your blog posts. After all I’ll be subscribing
      to your rss feed and I hope you write again very soon!

      Reply
    35. Willis says:
      5 years ago

      Your method of describing all in this piece of writing is
      genuinely nice, every one be capable of without difficulty know it, Thanks a
      lot.

      Reply
    36. Reuben says:
      5 years ago

      Thank you for some other great article. The place else may just anyone get
      that type of info in such an ideal method of writing? I’ve
      a presentation next week, and I’m at the search for
      such info.

      Reply
    37. Concetta says:
      5 years ago

      Woah! I’m really loving the template/theme of this blog. It’s simple, yet effective.
      A lot of times it’s tough to get that “perfect balance” between superb usability and visual appearance.
      I must say that you’ve done a great job with this. Additionally, the blog loads very fast for me on Chrome.
      Superb Blog!

      Reply
    38. Shela says:
      5 years ago

      Hello! I just wish to offer you a big thumbs up for your great info you’ve got here
      on this post. I am coming back to your web site for more soon.

      Reply
    39. Zara says:
      5 years ago

      I’m very pleased to discover this website. I want to to thank you for ones time just for this
      wonderful read!! I definitely enjoyed every bit of it and I have you bookmarked to see new stuff in your blog.

      Reply

    Leave a Reply Cancel reply

    Your email address will not be published. Required fields are marked *

    More in Big Data, Cloud & DevOps
    Big Data, Cloud & DevOps
    Cognitive Load Of Being On Call: 6 Tips To Address It

    If you’ve ever been on call, you’ve probably experienced the pain of being woken up at 4 a.m., unactionable alerts, alerts going to the wrong team, and other unfortunate events. But, there’s an aspect of being on call that is less talked about, but even more ubiquitous – the cognitive load. “Cognitive load” has perhaps

    5 MINUTES READ Continue Reading »
    Big Data, Cloud & DevOps
    How To Refine 360 Customer View With Next Generation Data Matching

    Knowing your customer in the digital age Want to know more about your customers? About their demographics, personal choices, and preferable buying journey? Who do you think is the best source for such insights? You’re right. The customer. But, in a fast-paced world, it is almost impossible to extract all relevant information about a customer

    4 MINUTES READ Continue Reading »
    Big Data, Cloud & DevOps
    3 Ways Businesses Can Use Cloud Computing To The Fullest

    Cloud computing is the anytime, anywhere delivery of IT services like compute, storage, networking, and application software over the internet to end-users. The underlying physical resources, as well as processes, are masked to the end-user, who accesses only the files and apps they want. Companies (usually) pay for only the cloud computing services they use,

    7 MINUTES READ Continue Reading »

    About Us

    Incubated in Harvard Innovation Lab, Experfy specializes in pipelining and deploying the world's best AI and engineering talent at breakneck speed, with exceptional focus on quality and compliance. Enterprises and governments also leverage our award-winning SaaS platform to build their own customized future of work solutions such as talent clouds.

    Join Us At

    Contact Us

    1700 West Park Drive, Suite 190
    Westborough, MA 01581

    Email: [email protected]

    Toll Free: (844) EXPERFY or
    (844) 397-3739

    © 2025, Experfy Inc. All rights reserved.