Amruta Deshpande

Data Processing and Feature Engineering with Dask

As datasets grow from MBs → GBs → TBs, traditional tools like Pandas begin to struggle: Runs out of memory Slow execution on large datasets Single-core processing bottleneck This is where Dask becomes a game-changer. Dask allows you to: Process large datasets that don’t fit into memory Scale computations across multiple CPU cores or machines …

Data Processing and Feature Engineering with Dask Read More »

Data Processing and Feature Engineering with NumPy

Efficient Numerical Computing and Array Operations Before any Machine Learning model, dashboard, or analytics pipeline is built, data must be processed and transformed efficiently.At the heart of almost every Python-based data workflow lies NumPy. NumPy provides:  High-performance numerical computation  Powerful multi-dimensional arrays  Vectorized operations (no slow Python loops)  The foundation for Pandas, SciPy, Scikit-learn, TensorFlow, …

Data Processing and Feature Engineering with NumPy Read More »

Data Processing and Feature Engineering with Pandas

Data Manipulation and Analysis for Machine Learning & Analytics Students In real-world data science and machine learning projects, data preparation consumes 70–80% of the total effort.Raw data is often: Incomplete Inconsistent Noisy Poorly structured This is where Data Processing and Feature Engineering come into play. Pandas, Python’s most powerful data manipulation library, provides everything needed …

Data Processing and Feature Engineering with Pandas Read More »

NumPy-like ML Libraries: JAX (by Google) & Theano (Legacy Research Library)

Modern machine learning relies on fast numerical computation, automatic differentiation, and efficient use of GPUs/TPUs. While NumPy remains the foundation for numerical Python programming, it lacks several advanced capabilities needed for training today’s deep learning models. This gap led to the creation of specialized ML libraries that feel like NumPy but go far beyond it. …

NumPy-like ML Libraries: JAX (by Google) & Theano (Legacy Research Library) Read More »

Understanding JAX: High-Performance Machine Learning with NumPy-like Syntax (by Google)

Introduction Modern machine learning requires speed, scalability, and flexibility.Researchers and engineers want: Python-like simplicity GPU/TPU acceleration Automatic differentiation Parallel computation Ease of experimentation While libraries like NumPy, TensorFlow, and PyTorch are widely used, Google introduced something faster and more flexible: JAX A high-performance machine learning library that feels like NumPy, but is powered by XLA …

Understanding JAX: High-Performance Machine Learning with NumPy-like Syntax (by Google) Read More »

Deep Learning Libraries: Understanding PyTorch

In the world of deep learning, few libraries have influenced research and innovation as profoundly as PyTorch.Developed by Meta (formerly Facebook), PyTorch has become one of the most popular and flexible frameworks for building and training neural networks — trusted by researchers, developers, and leading AI companies alike. Its ease of use, dynamic computation graphs, …

Deep Learning Libraries: Understanding PyTorch Read More »

Deep Learning Libraries: Understanding Keras

 Introduction Deep learning has revolutionized how machines learn from data — powering applications like image recognition, speech translation, medical diagnostics, and autonomous systems. However, building and training deep neural networks from scratch can be complex and time-consuming. That’s where Keras comes in — a high-level deep learning API designed to make building neural networks simple, …

Deep Learning Libraries: Understanding Keras Read More »

Deep Learning Libraries: Understanding TensorFlow by Google

Deep Learning is at the core of modern Artificial Intelligence — from self-driving cars and speech assistants to image recognition and generative AI. To make deep learning accessible and efficient, several libraries have emerged — among them, TensorFlow stands out as one of the most widely used and powerful frameworks. Developed by Google Brain Team, …

Deep Learning Libraries: Understanding TensorFlow by Google Read More »

MLxtend: Useful ML Extensions for Scikit-Learn

In the world of machine learning, Scikit-learn is one of the most widely used libraries — offering clean APIs for classification, regression, clustering, and more.But as your projects grow in complexity, you might sometimes wish scikit-learn had a few more tools and utilities for tasks like model stacking, plotting decision boundaries, or performing frequent pattern …

MLxtend: Useful ML Extensions for Scikit-Learn Read More »

The Lifecycle of JIRA, Agile, and BDD (Cucumber) in Software Development

 Software development today is not just about writing code—it’s about collaborating across roles, managing changes effectively, and ensuring business needs are met with precision. Three key pillars often used together are: Agile → The methodology/framework guiding the process. JIRA → The tool for managing workflows and tracking progress. BDD (Behavior-Driven Development) with Cucumber → The …

The Lifecycle of JIRA, Agile, and BDD (Cucumber) in Software Development Read More »