PySpark
I picked up PySpark during my first big data course at university. Its similarity to Python made it relatively easy to learn, and it was quite useful when paired with Jupyter Notebook. One of the most enjoyable challenges was a large project where we had to create friend recommendations for over 50,000 people in the dataset. It was a complex task that required a different way of thinking and exposed me to new language and syntax constructs. Successfully completing it provided a great sense of accomplishment and broadened my technical skills.
PostgreSQL
PostgreSQL was the first SQL language I learned, and it has been a love-hate relationship. I find the general syntax easy to work with, but configuration files often cause trouble. Despite these challenges, PostgreSQL remains a powerful tool. Over the past two years, I've used it for various tasks, including data mining, managing databases for my APIs, and other projects.
SQLAlchemy
SQLAlchemy is a newer technology I learned this year. Its ORM model provides an efficient way to connect to SQL databases and simplifies querying. It has saved me significant time by automating migrations, queries, and database management tasks. Currently, SQLAlchemy is the backend for one of the APIs I’m building.
Power BI
Experienced with Power BI for creating interactive dashboards and reports to analyze and visualize business data.