Question 1

SQL vs NoSQL — when to use each?

Accepted Answer

**SQL databases** (MySQL, PostgreSQL) store data in structured **tables** with a fixed schema. They enforce relationships, support complex queries, and guarantee strong consistency through ACID transactions. Choose SQL when your data is relational and predictable — e.g., users, orders, payments.

**NoSQL databases** (MongoDB, DynamoDB, Redis) offer flexible schemas and are designed for **horizontal scaling**. They trade some consistency guarantees for speed and flexibility. Choose NoSQL when you

Question 2

Explain the types of SQL JOINs.

Accepted Answer

- **INNER JOIN** — only rows matching in both tables.
- **LEFT JOIN** — all left rows + matching right (NULLs if none).
- **RIGHT JOIN** — all right rows + matching left.
- **FULL OUTER JOIN** — all rows from both (MySQL emulates with UNION).

```sql
SELECT u.name, o.total
FROM users u
LEFT JOIN orders o ON o.user_id = u.id;
```

Question 3

What is an index? What's the trade-off?

Accepted Answer

An index is a data structure (usually a B-tree) that speeds up reads by avoiding full table scans — like a book's index. Trade-off: it speeds up `SELECT`/`WHERE`/`JOIN` but slows down `INSERT`/`UPDATE`/`DELETE` and uses storage. Index columns you frequently filter or join on.

Question 4

What is database normalization?

Accepted Answer

Normalization is the process of organizing tables to **reduce redundancy** and **prevent update anomalies**.

- **1NF (First Normal Form)** — every column holds **atomic** (indivisible) values; no repeating groups.
- **2NF** — meets 1NF and has **no partial dependency** (every non-key column depends on the *whole* primary key, not just part of it).
- **3NF** — meets 2NF and has **no transitive dependency** (non-key columns depend only on the primary key, not on other non-key columns).

**Example

Question 5

What is SQL injection and how do you prevent it?

Accepted Answer

An attack where malicious input is concatenated into a query to alter its logic. Prevent it with **parameterized queries / prepared statements** (and ORMs that use them), plus input validation and least-privilege DB accounts. Never build SQL by string concatenation.

```js
// safe — parameterized
db.query("SELECT * FROM users WHERE email = ?", [email]);
```

Question 6

Primary key vs foreign key vs unique key.

Accepted Answer

- **Primary Key** — uniquely identifies every row in a table. Only **one** per table. Cannot be `NULL`.
- **Foreign Key** — a column that **references the primary key** of another table, enforcing referential integrity (you can't insert an order for a non-existent user).
- **Unique Key** — enforces uniqueness on a column (like email), but a table can have **multiple** unique keys and they allow **one NULL** (behavior varies by RDBMS).

```sql
CREATE TABLE users (
  id    INT PRIMARY KEY,

Question 7

What are ACID properties?

Accepted Answer

ACID is a set of guarantees that database transactions provide:

- **Atomicity** — a transaction is **all-or-nothing**. If any part fails, the entire transaction rolls back.
- **Consistency** — a transaction moves the database from one **valid state** to another, respecting all constraints and rules.
- **Isolation** — concurrent transactions do not **interfere** with each other. The result is the same as if they ran sequentially (exact behavior depends on the isolation level).
- **Durability** —

Question 8

WHERE vs HAVING; difference and order of execution.

Accepted Answer

- **WHERE** filters individual rows **before** grouping.
- **HAVING** filters groups **after** `GROUP BY` has been applied.

```sql
-- WHERE: filter rows before grouping
SELECT department, COUNT(*) AS cnt
FROM employees
WHERE status = 'active'
GROUP BY department
HAVING cnt > 5;           -- HAVING: filter groups after grouping
```

**Logical order of SQL execution:**

1. `FROM` (and `JOIN`)
2. `WHERE`
3. `GROUP BY`
4. `HAVING`
5. `SELECT`
6. `ORDER BY`
7. `LIMIT`

> You **cannot** use column al

Question 9

Find the 2nd highest salary — write the query.

Accepted Answer

**Approach 1 — LIMIT / OFFSET (MySQL, PostgreSQL):**

```sql
SELECT DISTINCT salary
FROM employees
ORDER BY salary DESC
LIMIT 1 OFFSET 1;
```

**Approach 2 — Subquery:**

```sql
SELECT MAX(salary) AS second_highest
FROM employees
WHERE salary < (SELECT MAX(salary) FROM employees);
```

**Approach 3 — DENSE_RANK window function (works for Nth highest):**

```sql
SELECT salary
FROM (
  SELECT salary, DENSE_RANK() OVER (ORDER BY salary DESC) AS rnk
  FROM employees
) ranked
WHERE rnk = 2;
```

> Us

Database