Luciano Nieto

Pipelines with Spark Databricks and Report

01 Oct 2023 - Author: Luciano Nieto

This pipeline provides a comprehensive demonstration of the end-to-end process, excluding the initial data source integration. It encompasses best practices in data modeling (star schema), leverages the power of Spark for distributed computation, and ultimately delivers a compelling outcome in the form of a detailed Power BI report.

Proven Practice, Using ChatGPT-3 API for Sentiment Analysis

10 Aug 2023 - Author: Luciano Nieto

Can ChatGPT Serve as an Integrated API for Multilingual Sentiment Analysis? In my pursuit of understanding sentiment analysis (since when I was in my graduation and the term “Data Mining” was hot), I encountered the challenge of effectively integrating the analysis of multiple languages simultaneously.

Providing Data with Flask Web API

24 Jul 2023 - Author: Luciano Nieto

As a data professional, understand web applications can be quite challenging due to its distinct development landscape. Despite this, I had a idea of creating a straightforward solution that demystifies the entire process and offers a clearly idea how the things works under the hood.

Using Apache Hop!

30 May 2023 - Author: Luciano Nieto

This use case was created from an idea to have an integration data tool for a small company, for instance. Therefore, it is possible to use Apache Hop to integrate different kinds of systems and data.

Streaming Twitter data

08 Mar 2023 - Author: Luciano Nieto

Intro This repository contains Python scripts that enable the processing of real-time Twitter data using Kafka and MongoDB. The scripts are designed to fetch data from the Twitter API, filter the data based on specific criteria, and store the filtered data into a MongoDB database.