PinnedPier Paolo IppolitoinTowards Data ScienceFeature Extraction TechniquesAn end to end guide on how to reduce a dataset dimensionality using Feature Extraction Techniques such as: PCA, ICA, LDA, LLE, t-SNE and…·11 min read·Oct 10, 2019--4--4
Pier Paolo IppolitoinTowards Data ScienceLLMs PitfallsAn introduction to some of the key components surrounding LLMs to produce production-grade applications·9 min read·May 7, 2024--1--1
Pier Paolo IppolitoinTowards Data ScienceIntroduction to Apache IcebergExploring Apache Iceberg benefits/drawbacks and how it can be used to build your own Lakehouse.·5 min read·Feb 29, 2024--2--2
Pier Paolo IppolitoinTowards Data ScienceIntroduction to Streaming FrameworksUnderstanding some of the key characteristics to consider when evaluating and comparing streaming technologies.·6 min read·Nov 8, 2023----
Pier Paolo IppolitoinTowards Data SciencePython on the WebShowcasing Python applications on the web without any server·9 min read·Oct 11, 2023--1--1
Pier Paolo IppolitoinTowards Data ScienceGetting started with JAXPowering the future of high-performance numerical computing and ML research·5 min read·Jul 7, 2023----
Pier Paolo IppolitoinTowards Data ScienceGeospatial Data Analysis in PythonGetting started with performing geographical data analysis in Python using OSMnx and Kepler.gl·6 min read·May 3, 2023--1--1
Pier Paolo IppolitoinTowards Data ScienceLow Code Time Series AnalysisUsing Darts to streamline your Python time series analysis development·6 min read·Mar 8, 2023--1--1
Pier Paolo IppolitoinTowards Data ScienceApache Spark Optimization TechniquesA review of some of the most common Spark performance problems and how to address them·5 min read·Jan 11, 2023--2--2
Pier Paolo IppolitoinTowards Data ScienceGetting Started with Apache SparkExploring some of the key concepts associated with Spark, and what defined its success in the Big Data realm·6 min read·Oct 14, 2022----