Tutorials

Learn about Tutorials on our blog

String Manipulation on PySpark DataFrames

String manipulation is a common task in data processing. PySpark provides a variety of built-in functions for manipulating string columns in DataFrames. Below, we explore some of the most useful string manipulation functions and demonstrate how to use them with…

Read More

Python Code Execution- Behind the Door- What happens?

I wrote a Python code or I created a Python script, and it executed successfully So what does it Mean? This will be the most basic question a Early Python Learner can ask ! So Consider this scenario- where i executed a script in python which saves a many csv in Local…

Read More

Spark SQL Join Types- Syntax examples, Comparision

Spark SQL supports several types of joins, each suited to different use cases. Below is a detailed explanation of each join type, including syntax examples and comparisons. Types of Joins in Spark SQL Inner Join Left (Outer) Join Right (Outer) Join Full (Outer) Join…

Read More

Temporary Functions in PL/Sql Vs Spark Sql

Temporary functions allow users to define functions that are session-specific and used to encapsulate reusable logic within a database session. While both PL/SQL and Spark SQL support the concept of user-defined functions, their implementation and usage differ…

Read More

Are Dataframes in PySpark Lazy evaluated?

Yes, DataFrames in PySpark are lazily evaluated, similar to RDDs. Lazy evaluation is a key feature of Spark’s processing model, which helps optimize the execution of transformations and actions on large datasets. What is Lazy Evaluation? Lazy evaluation means…

Read More
BDL Ecosystem-HDFS and Hive Tables

BDL Ecosystem-HDFS and Hive Tables

Big Data Lake: Data Storage HDFS is a scalable storage solution designed to handle massive datasets across clusters of machines. Hive tables provide a structured approach for querying and analyzing data stored in HDFS. Understanding how these components work together…

Read More

Python input function in Detail- interesting usecases

The input() function in Python is primarily used to take input from the user through the command line. While its most common use is to receive text input, it can be used creatively for various purposes. The input() function in Python The input() function in Python is…

Read More

Python Strings Interview Questions

Python Programming Strings Interview Questions Write a Python program to remove a Specific character from string? Here’s a Python program to remove a specific character from a string: def remove_char(text, char): “”” Removes a specific character from a string….

Read More

SAS Character Functions, Date Functions

here’s a table summarizing some common SAS List Date functions with their syntax and examples: Here’s a breakdown of some key categories with representative functions, syntax, and examples:       1. Retrieving…

Read More

SAS First., Last. Syntax and uses with examples

In SAS, the FIRST. and LAST. automatic variables are used within a DATA step to identify the first and last occurrences of observations within a BY group. These variables are particularly useful when working with sorted data or when you need to perform specific…

Read More