r

KDnuggets™ News 20:n16, Apr 22: Scaling Pandas with Dask for Big Data; Dive Into Deep Learning: The Free eBook

4 Steps to ensure your AI/Machine Learning system survives COVID-19; State of the Machine Learning and AI Industry; A Key Missing Part of the Machine Learning Stack; 5 Papers on CNNs Every Data Scientist Should Read




r

Free High-Quality Machine Learning & Data Science Books & Courses: Quarantine Edition

If you find yourself quarantined and looking for free learning materials in the way of books and courses to sharpen your data science and machine learning skills, this collection of articles I have previously written curating such things is for you.




r

Math and Architectures of Deep Learning

This hands-on book bridges the gap between theory and practice, showing you the math of deep learning algorithms side by side with an implementation in PyTorch. You can save 40% off Math and Architectures of Deep Learning until May 13! Just enter the code nlkdarch40 at checkout when you buy from manning.com.




r

Fighting Coronavirus With AI: Improving Testing with Deep Learning and Computer Vision

This post will cover how testing is done for the coronavirus, why it's important in battling the pandemic, and how deep learning tools for medical imaging can help us improve the quality of COVID-19 testing.




r

Should Data Scientists Model COVID19 and other Biological Events

Biostatisticians use statistical techniques that your current everyday data scientists have probably never heard of. This is a great example where lack of domain knowledge exposes you as someone that does not know what they are doing and are merely hopping on a trend.




r

Data context and how to get started with understanding COVID-19 data

If you are already applying your Data Science skills or getting ready to contribute to analyzing COVID-19 data, then be sure to take sufficient time to appreciate the context of the numbers to focus on what's most important as we collaborate on this global battle.




r

Top KDnuggets tweets, Apr 15-21: 21 Techniques to Write Better #Python Code with #PyCharm examples

Also: Math for Programmers!; If #Programming languages had honest slogans #humor; 5 Papers on CNNs Every Data Scientist Should Read; Why Understanding CVEs Is Critical for Data Scientists




r

Find Your Perfect Fit: A Quick Guide for Job Roles in the Data World

Data related positions are considered the hottest in the job market during the last couple of years. While everyone wants to join the party and enter this fascinating field, it is essential to first get an understanding. In this quick guide, I’ll do my best to dispel the confusion by crystalizing the essence of the different positions.




r

3 Reasons Why We Are Far From Achieving Artificial General Intelligence

How far we are from achieving Artificial General Intelligence? We answer this through the study of three limitations of current machine learning.




r

Data Transformation: Standardization vs Normalization

Increasing accuracy in your models is often obtained through the first steps of data transformations. This guide explains the difference between the key feature scaling methods of standardization and normalization, and demonstrates when and how to apply each approach.




r

DBSCAN Clustering Algorithm in Machine Learning

An introduction to the DBSCAN algorithm and its Implementation in Python.




r

The Super Duper NLP Repo: 100 Ready-to-Run Colab Notebooks

Check out this repository of more than 100 freely-accessible NLP notebooks, curated from around the internet, and ready to launch in Colab with a single click.




r

Learning during a crisis (Data Science 90-day learning challenge)

How can you keep your focus and drive during a global crisis? Take on a 90-day learning challenge for data science and check out this list of books and courses to follow.




r

Google Open Sources SimCLR, A Framework for Self-Supervised and Semi-Supervised Image Training

The new framework uses contrastive learning to improve image analysis in unlabeled datasets.




r

Top Stories, Apr 20-26: The Super Duper NLP Repo; Free High-Quality Machine Learning & Data Science Books & Courses

Also: Should Data Scientists Model COVID19 and other Biological Events; 5 Papers on CNNs Every Data Scientist Should Read; 24 Best (and Free) Books To Understand Machine Learning; Mathematics for Machine Learning: The Free eBook; Find Your Perfect Fit: A Quick Guide for Job Roles in the Data World




r

A Concise Course in Statistical Inference: The Free eBook

Check out this freely available book, All of Statistics: A Concise Course in Statistical Inference, and learn the probability and statistics needed for success in data science.




r

LSTM for time series prediction

Learn how to develop a LSTM neural network with PyTorch on trading data to predict future prices by mimicking actual values of the time series data.




r

10 Best Machine Learning Textbooks that All Data Scientists Should Read

Check out these 10 books that can help data scientists and aspiring data scientists learn machine learning today.




r

How Data Scientists Can Train and Updates Models to Prepare for COVID-19 Recovery

The COVID-19 pandemic has affected everything, and building predictions during this time is difficult. Data science teams need to update their models to prepare for the recovery, and know how to properly train 2020 data models to learn from the coronavirus anomaly.




r

Trustworthy Online Controlled Experiments: A Practical Guide to A/B Testing

The book Trustworthy Online Controlled Experiments: A Practical Guide to A/B Testing by Ron Kohavi (Microsoft, Airbnb), Diane Tang (Google) and Ya Xu (LinkedIn) is available for purchase, with the authors proceeds from the book being donated to charity.




r

KDnuggets™ News 20:n17, Apr 29: The Super Duper NLP Repo; Free Machine Learning & Data Science Books & Courses for Quarantine

Also: Should Data Scientists Model COVID19 and other Biological Events; Learning during a crisis (Data Science 90-day learning challenge); Data Transformation: Standardization vs Normalization; DBSCAN Clustering Algorithm in Machine Learning; Find Your Perfect Fit: A Quick Guide for Job Roles in the Data World




r

Coronavirus COVID-19 Genome Analysis using Biopython

So in this article, we will interpret, analyze the COVID-19 DNA sequence data and try to get as many insights regarding the proteins that made it up. Later will compare COVID-19 DNA with MERS and SARS and we’ll understand the relationship among them.




r

Understanding the COVID-19 Pandemic Using Interactive Visualizations

Interactive visualizations are an effective method for understanding the COVID-19 pandemic. This article presents a repository filled with just such insightful interactions.




r

Introducing Brain Simulator II: A New Platform for AGI Experimentation

A growing consensus of researchers contend that new algorithms are needed to transform narrow AI to AGI. Brain Simulator II is free software for new algorithm development targeted at AGI that you can experiment with and participate in its development.




r

Top KDnuggets tweets, Apr 22-28: 24 Best (and Free) Books To Understand Machine Learning

Also: A Concise Course in Statistical Inference: The Free eBook; ML Ops: Machine Learning as an Engineering Discipline; Learning during a crisis (#DataScience 90-day learning challenge) ; Free High-Quality Machine Learning & Data Science Books & Courses: Quarantine Edition




r

Five Cool Python Libraries for Data Science

Check out these 5 cool Python libraries that the author has come across during an NLP project, and which have made their life easier.




r

Exploring the Impact of Geographic Information Systems

GIS has mostly been behind more popular buzzwords like machine learning and deep learning. GIS has always been around us in the background being used in government, business, medicine, real estate, transport, manufacturing etc.




r

Outbreak Analytics: Data Science Strategies for a Novel Problem

You walk down one aisle of the grocery store to get your favorite cereal. On the dairy aisle, someone sick from COVID-19 coughs. Did your decision to grab your cereal before your milk possibly keep you healthy? How can these unpredictable, near-random choices be included in complex models?




r

Natural Language Processing Recipes: Best Practices and Examples

Here is an overview of another great natural language processing resource, this time from Microsoft, which demonstrates best practices and implementation guidelines for a variety of tasks and scenarios.




r

Which Face is Real? Applying StyleGAN to Create Fake People

This post explains using a pre-trained GAN to generate human faces, and discusses the most common generative pitfalls associated with doing so.




r

Optimize Response Time of your Machine Learning API In Production

This article demonstrates how building a smarter API serving Deep Learning models minimizes the response time.




r

Demystifying the AI Infrastructure Stack

AI tools and services are expanding at a rapid clip, and keeping a handle on this evolving ecosystem is crucial for the success of your AI projects. This framework will help you build your technical stack to deploy AI projects faster and at scale.




r

Top Stories, Apr 27 – May 3: Five Cool Python Libraries for Data Science; Natural Language Processing Recipes: Best Practices and Examples

Also: Coronavirus COVID-19 Genome Analysis using Biopython; LSTM for time series prediction; A Concise Course in Statistical Inference: The Free eBook; Exploring the Impact of Geographic Information Systems





r

Microsoft Research Unveils Three Efforts to Advance Deep Generative Models

Optimus, FQ-GAN and Prevalent bring new ideas to apply generative models at large scale.




r

How use the Coronavirus crisis to kickstart your Data Science career

As the global economy dwindles, tech companies are hiring en masse. Now is the time to get yourself noticed as a Data Scientist and try to land your dream job.




r

Top 10 Data Visualization Tools for Every Data Scientist

At present, the data scientist is one of the most sought after professions. That’s one of the main reasons why we decided to cover the latest data visualization tools that every data scientist can use to make their work more effective.




r

Getting Started with Spectral Clustering

This post will unravel a practical example to illustrate and motivate the intuition behind each step of the spectral clustering algorithm.




r

Beginners Learning Path for Machine Learning

So, you are interested in machine learning? Here is your complete learning path to start your career in the field.




r

Statistical Thinking for Industrial Problem Solving – a free online statistics course

This online course is available – for free – to anyone interested in building practical skills in using data to solve problems better.




r

KDnuggets™ News 20:n18, May 6: Five Cool Python Libraries for Data Science; NLP Recipes: Best Practices

5 cool Python libraries for Data Science; NLP Recipes: Best Practices and Examples; Deep Learning: The Free eBook; Demystifying the AI Infrastructure Stack; and more.




r

Best Coronavirus Projections, Predictions, Dashboards and Data Resources

Check out this curated collection of coronavirus-related projections, dashboards, visualizations, and data that we have encountered on the internet.




r

Explaining “Blackbox” Machine Learning Models: Practical Application of SHAP

Train a "blackbox" GBM model on a real dataset and make it explainable with SHAP.




r

Were 21% of New York City residents really infected with the novel coronavirus?

Understanding the types of statistical bias that pop up in popular media and reporting is especially important during this pandemic where the data -- and our global response to the data -- directly impact peoples' lives.




r

Top KDnuggets tweets, Apr 29 – May 5: 24 Best (and Free) Books To Understand Machine Learning

What are Some 'Advanced ' #AI and #MachineLearning Online Courses?; 24 Best (and Free) Books To Understand Machine Learning; Top 5 must-have #DataScience skills for 2020






r

Forecasting Stories 3: Each Time-series Component Sings a Different Song

With time-series decomposition, we were able to infer that the consumers were waiting for the highest sale of the year rather than buying up-front.




r

Data Scientists, Corporate Fortune Tellers

I realized that from a corporate perspective, “fortune teller” was not entirely off from the role of a “data scientist”.




r

Will Machine Learning Engineers Exist in 10 Years?

As can be common in many technical fields, the landscape of specialized roles is evolving quickly. With more people learning at least a little machine learning, this could eventually become a common skill set for every software engineer.