Skip to content

Appendix: Additional Resources to be reviewed and read

Mikiko Bazeley edited this page Dec 27, 2019 · 19 revisions

Free books!:

Visualization: https://towardsdatascience.com/february-edition-data-visualization-ee524ca64416

NLP: https://towardsdatascience.com/intro-to-nlp-using-inaugural-speeches-of-presidents-8c7ca32cbdfe

Self tracking: https://towardsdatascience.com/what-should-you-be-tracking-in-2019-a9415a700897

Docker: https://towardsdatascience.com/learn-enough-docker-to-be-useful-b7ba70caeb4b

Interpretable Machine Learning:


Label Encoder, 1 Hot Encoding, Scaling on Continuous features, Imputation w/ flags, Using frequency or response rate to combine levels, Find & Replace, Using CategoryEncoders package for the library

https://hackernoon.com/my-self-created-ai-masters-degree-ddc7aae92d0e

Advanced Pipelines tutorial:

Rajiv Shah Video: https://twitter.com/srikanth_vikas/status/1092585613314772992


Tableau:


Python:


NLP:

Fleshed out pythone notebook in Kaggle:

Dealing with Categorical Variables:

Multi Level Classification:

https://www.dataquest.io/blog/free-books-learn-data-science/

When to use which visualization packages?:

Understanding a grammar of graphics:

Installing from Jupyter notebook:

Code Review & Linters:

https://towardsdatascience.com/automated-machine-learning-hyperparameter-tuning-in-python-dfda59b72f8a

https://blog.usejournal.com/how-to-get-your-product-and-engineering-teams-running-like-clockwork-8c5c342721df

https://towardsdatascience.com/data-science-project-flow-for-startups-282a93d4508d

https://www.dataengineeringpodcast.com/

https://www.youtube.com/watch?v=8jIgng-ViPU

https://www.youtube.com/watch?v=0cEfe9mSatM

http://www.pythontutor.com/

Regularization:

Tutorials on various items:


Hi @channel I am currently working on web scraping and am supposed to write a unit test for my scrapy spider. I am trying to understand how to use Mock classes for mocking the responses, but unfortunately haven't been able to get hold of how it is actually working and how can I write my unit test using it. If anyone of you have worked on this before, can you please share some resources which I can use to make sense of the concept and put in practice? Thanks


@channel Hi! I am looking for some refences for on the following topics related to Time Series problems- contextual anomaly detection/finding irregular data patterns imputation/smoothing for the missing data forecasting techniques I have been asked to give a short presentation on these topics during the interview. I would appreciate if you could share some articles. Mainly I am looking for the first articles related to contextual anomaly detection

Jean-Sebastien - DSDJ Team 4 days ago Here are some references that could be useful: https://arxiv.org/abs/1904.00548 https://towardsdatascience.com/a-note-about-finding-anomalies-f9cedee38f0b https://era.library.ualberta.ca/items/af6513a0-e09a-47d3-8d74-2d6862f991e4 https://journalofbigdata.springeropen.com/articles/10.1186/s40537-014-0011-y arXiv.orgarXiv.org Unsupervised Contextual Anomaly Detection using Joint Deep... A method for unsupervised contextual anomaly detection is proposed using a cross-linked pair of Variational Auto-Encoders for assigning a normality score to an observation. The method enables a... MediumMedium A note about finding anomalies Anomaly detection refers to the task of finding observations that do not conform to the normal, expected behaviour… Reading time 10 min read Apr 10th, 2018 (173 kB) https://miro.medium.com/max/1200/1*6_pj7EF6xY9dB2lgdhQ69g.png ERAERA Time Series Contextual Anomaly Detection... | ERA Anomaly detection in time series is one of the fundamental issues in data mining. It addresses various problems in different domains such... Keywords Market manipulation, Sentiment analysis, Time series, Anomaly detection, Stock market Rights This thesis is made available by the University of Alberta Libraries with permission of the copyright owner solely for non-commercial purposes. This thesis, or any portion thereof, may not otherwise be copied or reproduced without the written consent of the copyright owner, except to the extent permitted by Canadian copyright law. Journal of Big Data Contextual anomaly detection framework for big sensor data The ability to detect and process anomalies for Big Data in real-time is a difficult task. The volume and velocity of the data within many systems makes it difficult for typical algorithms to scale and retain their real-time characteristics. The pervasiveness of data combined with the problem that many existing algorithms only consider the content of the data source; e.g. a sensor reading itself without concern for its context, leaves room for potential improvement. The proposed work defines a contextual anomaly detection framework. It is composed of two distinct steps: content detection and context detection. The content detector is used to determine anomalies in real-time, while possibly… Show more :ultra_fast_parrot: 1 :+1: 1

Chris - DSDJ Team 4 days ago For Time Series: https://dimensionless.in/beginners-guide-for-time-series-forecasting/ https://varjo.ktto.fi/files/ekonometrian-jatkokurssi-1498336030292.pdf https://www.kdnuggets.com/2018/03/time-series-dummies-3-step-process.html https://www.machinelearningplus.com/time-series/arima-model-time-series-forecasting-python/ DIMENSIONLESS TECHNOLOGIES PVT.LTD.DIMENSIONLESS TECHNOLOGIES PVT.LTD. Beginner’s Guide for Time-Series Forecasting | Blog | Dimensionless This is a biginners guide to time series forecasting. We will solve a small time series problem & learn time series forecating along the way. Feb 13th KDnuggetsKDnuggets Time Series for Dummies – The 3 Step Process - KDnuggets Time series forecasting is an easy to use, low-cost solution that can provide powerful insights. This post will walk through introduction to three fundamental steps of building a quality model. Machine Learning PlusMachine Learning Plus ARIMA Model - Complete Guide to Time Series Forecasting in Python | ML+ Using ARIMA model, you can forecast a time series using the series past values. In this post, we build an optimal ARIMA model from scratch and extend it to Seasonal ARIMA (SARIMA) and SARIMAX models. You will also see how to build autoarima models in python Feb 18th :+1: 1

Chris - DSDJ Team 4 days ago https://machinelearningmastery.com/how-to-develop-lstm-models-for-time-series-forecasting/ https://machinelearningmastery.com/lstm-model-architecture-for-rare-event-time-series-forecasting/ https://machinelearningmastery.com/findings-comparing-classical-and-machine-learning-methods-for-time-series-forecasting/ Machine Learning MasteryMachine Learning Mastery How to Develop LSTM Models for Time Series Forecasting Long Short-Term Memory networks, or LSTMs for short, can be applied to time series forecasting. There are many types of LSTM models that can be used for each specific type of time series forecasting problem. In this tutorial, you will discover how to develop a suite of LSTM models for a range of standard time … Nov 13th, 2018 (256 kB) https://3qeqpr26caki16dnhd19sv6by6v-wpengine.netdna-ssl.com/wp-content/uploads/2018/11/How-to-Develop-LSTM-Models-for-Time-Series-Forecasting.jpg Machine Learning MasteryMachine Learning Mastery LSTM Model Architecture for Rare Event Time Series Forecasting Time series forecasting with LSTMs directly has shown little success. This is surprising as neural networks are known to be able to learn complex non-linear relationships and the LSTM is perhaps the most successful type of recurrent neural network that is capable of directly supporting multivariate sequence prediction problems. A recent study performed at Uber … Nov 1st, 2018 (228 kB) https://3qeqpr26caki16dnhd19sv6by6v-wpengine.netdna-ssl.com/wp-content/uploads/2018/11/Sliding-Window-Approach-to-Modeling-Time-Series.png Machine Learning MasteryMachine Learning Mastery Comparing Classical and Machine Learning Algorithms for Time Series Forecasting Machine learning and deep learning methods are often reported to be the key solution to all predictive modeling problems. An important recent study evaluated and compared the performance of many classical and modern machine learning and deep learning methods on a large and diverse set of more than 1,000 univariate time series forecasting problems. The … Oct 30th, 2018 (299 kB) https://3qeqpr26caki16dnhd19sv6by6v-wpengine.netdna-ssl.com/wp-content/uploads/2018/10/Findings-Comparing-Classical-and-Machine-Learning-Methods-for-Time-Series-Forecasting.jpg

Chris - DSDJ Team 4 days ago https://otexts.com/fpp2/ otexts.comotexts.com Forecasting: Principles and Practice 2nd edition :ultra_fast_parrot: 1

Prashant 4 days ago @Chris - DSDJ Team @Jean-Sebastien - DSDJ Team Thank you very much


Also potentially recommend (looks interesting) -- http://www.pythonvisually.com/

pythonvisually.compythonvisually.com Learn Python VISUALLY - Learn Differently Learn Python VISUALLY


Syed Toufiq Dec 10th at 12:56 AM Does anyone has handon with attribution modelling , can you please suggest good resources to get started and get understanding for the same 3 replies

Mikiko Bazeley - DSDJ Team 9 days ago Is this a question related to a marketing project? Attribution modeling typically is the case where marketin teams get leads and want to understand the origin or "attribute" the origin to a specific marketing channel (Facebook Ads vs. Google Search, social media vs. email marketing, etc). Does that sound like the right understanding?

Syed Toufiq 9 days ago Yes its related to Marketing project , thanks for the reply sounds right i need to dive into more details , Can you recommend any good resources?

Mikiko Bazeley - DSDJ Team 8 days ago So quick note, attribution models typically tend to be rules based approaches that aren't mathematical or rigorous and are very heuristic based. And attribution modeling isn't modeling in the sense of data science or machine learning (the reality is there's nothing really intelligent about it). I've posted some resources below: https://agencyanalytics.com/blog/marketing-attribution-models <= Different rules based approaches for creating an attribution model https://www.kaushik.net/avinash/multi-channel-attribution-modeling-good-bad-ugly-models/ <= More about attribution models https://www.referralsaasquatch.com/marketing-attribution-models/ <= Things to think about when choosing attribution models https://www.ngdata.com/marketing-attribution-models-for-success/ <= More opinions about models https://cxl.com/blog/conversion-attribution-modeling/ <= Another resource If you're using google analytics: https://support.google.com/analytics/topic/1631741?hl=en&ref_topic=3544907


https://cedar.buffalo.edu/~srihari/CSE676/


https://deeplearning.mit.edu/ - Deep Learning

Clone this wiki locally