Open in app

Sign In

Write

Sign In

Pier Paolo Ippolito
Pier Paolo Ippolito

5.1K Followers

Home

About

Published in Towards Data Science

·Pinned

Feature Extraction Techniques

An end to end guide on how to reduce a dataset dimensionality using Feature Extraction Techniques such as: PCA, ICA, LDA, LLE, t-SNE and AE. — Introduction It is nowadays becoming quite common to be working with datasets of hundreds (or even thousands) of features. If the number of features becomes similar (or even bigger!) than the number of observations stored in a dataset then this can most likely lead to a Machine Learning model suffering…

Machine Learning

11 min read

Feature Extraction Techniques
Feature Extraction Techniques
Machine Learning

11 min read


Published in Towards Data Science

·Mar 8

Low Code Time Series Analysis

Using Darts to streamline your Python time series analysis development — Introduction Time Series Forecasting is a unique field in Machine Learning. When working with time series in fact there is an inherent time dependency between the different points in the series and therefore the different observations are highly dependent on each other. …

Data Science

6 min read

Low Code Time Series Analysis
Low Code Time Series Analysis
Data Science

6 min read


Published in Towards Data Science

·Jan 11

Apache Spark Optimization Techniques

A review of some of the most common Spark performance problems and how to address them — Introduction Apache Spark is currently one of the most popular big data technologies used in the industry, supported by companies such as Databricks and Palantir. One of the key responsibilities of Data Engineers when using Spark, is to write highly optimized code in order to fully take advantage of Spark's distributed…

Big Data

5 min read

Apache Spark Optimization Techniques
Apache Spark Optimization Techniques
Big Data

5 min read


Published in Towards Data Science

·Oct 14, 2022

Getting Started with Apache Spark

Exploring some of the key concepts associated with Spark, and what defined its success in the Big Data realm — Introduction One of the main issues that characterized the inception of the internet as we know it today was the inability to scale (e.g. being able to search large amounts of data in a short time, support a constantly varying amount of users without any downtime, etc…). This was ultimately due…

Big Data

6 min read

Getting Started with Apache Spark
Getting Started with Apache Spark
Big Data

6 min read


Published in Towards Data Science

·Jul 8, 2022

How to Develop Online Revenue Streams as a Data Scientist

Exploring some of the different approaches which can be used in order to create a side income online. — Introduction Working as a Data Scientist for a company can be a really rewarding opportunity, with average salaries in the US reaching about $117,806 per year in 2021 [1]. …

Data Science

5 min read

How to Develop Online Revenue Streams as a Data Scientist
How to Develop Online Revenue Streams as a Data Scientist
Data Science

5 min read


Published in Towards Data Science

·Apr 22, 2022

Embedding Interactive Python Plots on the Web

A guide on how to use Plotly Chart Studio and Datapane to share Python plots on the web — Introduction One of the most important steps in the Data Science pipeline is Data Visualization. In fact, thanks to Data Visualization, Data Scientists can be able to quickly gather insights about the data they have available and any possible anomaly. Traditionally, Data Visualization consisted of creating static images and summary statistics…

Data Visualization

5 min read

Embedding Interactive Python Plots on the Web
Embedding Interactive Python Plots on the Web
Data Visualization

5 min read


Published in Towards Data Science

·Feb 10, 2022

Artificial Intelligence for Cybersecurity

A guide through some of the key applications of AI in order to enhance cybersecurity systems. — Introduction One of the most promising applications of Artificial Intelligence (AI) is Cybersecurity. In fact, managing the security of large distributed systems can easily become an exponentially complicated task considering all the possible different scenarios an adversarial entity could try to use to take advantage of the security infrastructure in place…

Data Science

5 min read

Artificial Intelligence for Cybersecurity
Artificial Intelligence for Cybersecurity
Data Science

5 min read


Published in Towards Data Science

·Jan 12, 2022

Design Patterns in Machine Learning for MLOps

Outlining some of the most common design patterns encountered when creating successful Machine Learning solutions — Introduction Design Patterns are a set of best practices and reusable solutions to common problems. Data Science and other disciplines such as Software Development, Architecture, etc. …

Data Science

7 min read

Design Patterns in Machine Learning for MLOps
Design Patterns in Machine Learning for MLOps
Data Science

7 min read


Published in Towards Data Science

·Dec 8, 2021

Getting started with Feature Stores

An Introduction to what are Feature Stores and how can they be used in order to streamline your Machine Learning processes — Introduction Creating Machine Learning models able to perform reliably in production, can be a very difficult process. These models can in fact only be as good as the data it is used to train them. Therefore, being able to create a process capable to accurately pre-process all the incoming data and…

Data Science

5 min read

Getting started with Feature Stores
Getting started with Feature Stores
Data Science

5 min read


Published in Towards Data Science

·Nov 3, 2021

Azure for Machine Learning Engineers

Getting an understanding of all the different AI services provided by Azure and when they should be used — Introduction As more and more companies decide to move their on-premises datacenters to the cloud, cloud skills are now becoming increasingly important. In 2020, Microsoft Azure was declared the fastest growing cloud provider [1] and therefore I decided to challenge myself to learn more about their Data Science services and complete…

Data Science

5 min read

Azure for Machine Learning Engineers
Azure for Machine Learning Engineers
Data Science

5 min read

Pier Paolo Ippolito

Pier Paolo Ippolito

5.1K Followers

Data Analytics @ Swiss Re, TDS Associate Editor and Freelancer. https://linktr.ee/pierpaolo28

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech