Data Structuring, Acquisition, Cleaning, and ETL Cheatsheet

Module 1 Cheatsheet — Foundations, Acquisition, DOM/XPath, Structuring


1) What “Big Data” means:


No single size cutoff. Think:
Volume (lots of rows/large objects), Variety (heterogeneous), Velocity (fast-changing), Veracity (quality).Implication: choose tools/storage/pipelines that match data scale and shape.

2) Three canonical encodings:


Tables / Relations / DataFrames

Heterogeneous columns; ideal for SQL/Pandas joins, filters, aggregations.

Arrays / Matrices / Tensors

Uniform numeric cells;

Read More

Critical Perspectives on Black Diaspora Identity and Struggle

I. Maafa: The Scope of the Transatlantic Slave Trade

Gomez, Exchanging Our Country Marks, Chapter 1: “Time and Space”

Summary: Gomez explores how African identities were preserved, adapted, or transformed in the Americas. He challenges the idea that Africans lost their cultural roots during enslavement and instead argues that memory, kinship, and religion helped sustain African identities.

Key Concepts:

  • Maafa: A Kiswahili term meaning “great disaster,” used to describe the transatlantic slave trade
Read More

Philosophical Reflections on Power, Happiness, and Society

Philosophical Reflections on Personal Motivation

Hobbes, Power, and the Pursuit of Happiness

Since I was a kid, my parents didn’t pay much attention to how I was doing things in general; they were more focused on my older brother. While studying in school, I understood that happiness was all I was looking for, and all my actions caused and helped decide my purposes in life. Aristotle discusses the importance of virtue and happiness through daily life, but also conducting yourself in a way that accustoms

Read More

Ancient Literary Traditions from Around the World

Understanding Myths in Literature

The word ‘myth’ often refers to a person or event that stands out for a particular reason, becoming an object of admiration and a model for others. Key features of myths include:

  • The narration of an event prior to historical time.
  • The involvement of superhuman beings.
  • Attempts to explain the fundamental issues of existence.
  • Holding religious significance.

Types of myths include cosmological myths and those detailing the practices of worship of gods and heroes.

Ancient

Read More

Spanish Post-War Literature: Novel and Theater in the 1940s

The Cultural Impact of the Spanish Civil War

The dramatic consequences of the Civil War were felt not only in the social, political, economic, and human spheres, but also, and very intensely, in the field of culture. With the end of the war, the best writers went into exile, mainly to Latin America, but they kept in touch with the writers who remained in Spain, and their books circulated widely, despite censorship.

Spanish Novel of the 1940s: Social Realism and Exile

The social theme is present throughout

Read More

Mastering SQL: Practical Database Design and Query Examples

SQL Database Management: Supplier and Product Inventory

Defining the Inventory Schema (DDL)

We begin by setting up the database structure for tracking suppliers and their products. Note the use of PRIMARY KEY, FOREIGN KEY, and CHECK constraints for ensuring data integrity.

Supplier Table Creation

CREATE TABLE Supplier (
    SupplierID INT PRIMARY KEY,
    Name VARCHAR(100) NOT NULL,
    City VARCHAR(50)
);

Product Table Creation

CREATE TABLE Product (
    ProductID INT PRIMARY KEY,
    Name VARCHAR(100)
Read More

Fundamentals of Software Development, Testing, and Modeling

What is Software and Its Common Myths?

Definition of Software

Software is a set of instructions, data, or programs used to operate computers and execute specific tasks. It is the intangible component of computers, as opposed to hardware.

Types of Software

  • System Software: e.g., Operating Systems
  • Application Software: e.g., MS Word, Web Browsers
  • Programming Software: e.g., Compilers, Editors

Common Software Myths

(Content for this section was not provided).

Core Software Engineering Concepts

Software Engineering

Read More

Econometric Methods for Establishing Causality

1. RCTs Versus Quasi-Experimental Methods

A randomized controlled trial bakes causality into the data: random assignment severs every systematic link between treatment status and unobserved factors. Consequently, the simple difference in average outcomes is unbiased, provided compliance and sample integrity hold. The four quasi-experimental tools imitate this causal magic only by leaning on single-point assumptions:

  • Ordinary Least Squares (OLS): Assumes no omitted variable is correlated with the regressor.
Read More

Software Maintenance, Evolution, and Documentation Best Practices

Software Maintenance Fundamentals (Week 1)

Definition of Software Maintenance (SM)

  • “The modification to a software product after delivery to correct faults.”

Maintenance vs. Software Evolution

MaintenanceEvolution
All activities preventing failure or decay of a software system.Activities that provide new or changed functionality only.

Software Aging and Decay

  • Software ages when it does not adapt to changing technology, user needs, or competitors.

Lehman’s Laws of Software Evolution

  1. Continuing Change:
    • Real-
Read More

Aggregate Supply, Demand, and Economic Growth Factors

Aggregate Supply and Demand Dynamics

Short-Run and Long-Run Aggregate Supply

  • Determinants of Short-Run Aggregate Supply (SRAS)

    What are the determinants of short-run aggregate supply? Changes in input prices, taxes, and business and inflationary expectations.

  • Determinant of Aggregate Supply (AS)

    What is a key determinant of aggregate supply? Productivity.

  • SRAS Curve Slope

    The short-run aggregate supply curve slopes upward because profits increase at high price levels and because many input prices are slow

Read More