What is the difference between a vibecoder and an AI maker?

A vibecoder prompts an AI tool and ships whatever comes out, without understanding why it works or how to fix it when it breaks. An AI maker understands enough of the underlying concepts to debug failures, catch security holes, make architectural decisions, and extend their product beyond what the AI can generate in one shot. You do not need a computer science degree to be an AI maker. You need a working mental model of the 50 concepts in this guide.

Do I need to learn to code to use this guide?

No. This guide is written for people who build with AI tools — Claude Code, Cursor, Codex — and want to understand what their tools are doing. You do not need to write the code yourself. But understanding what JSON is, what a 401 means, what a context window is, and why your database query is slow will make you dramatically more effective at directing AI tools and catching their mistakes.

How long does it take to learn these 50 concepts?

At one concept per day, 50 days. At ten concepts per weekend, five weekends. Most people who read this guide carefully and refer back to it when they hit problems absorb most of it within a month of active building. The goal is not memorization — it is having enough of a mental model that you know what to Google when things break.

50 Tech Concepts Every Vibecoder Must Know (2026) | explainx.ai Blog

explainx.ainewsletter3.5k

workshops ↗

50 Tech Concepts Every Vibecoder Must Know (2026) | explainx.ai Blog | explainx.ai

The "who is json" moment is not the problem. The problem is building a product on a foundation you cannot see.

When everything works, not knowing what JSON is, what a 401 means, or why your database query is slow does not matter. When things break — and they always break — that invisible foundation becomes the only thing that matters. Your AI tool will generate an error message you cannot read. It will make a security decision you cannot evaluate. It will choose a data architecture you cannot extend. And you will be stuck, dependent on the AI to diagnose a problem neither of you fully understands.

This guide is not about learning to code. It is about building the mental models that make you effective at directing AI tools, catching their mistakes, and extending what they build beyond the first working version.

Fifty concepts. Organized into eight categories. Explained at the depth you need to actually use them — not deeper, not shallower.

Category 1: Data & Formats

1. JSON

What it is: JSON (JavaScript Object Notation) is a text format for structured data. It looks like this:

json

{
  "user": "Yash",
  "plan": "pro",
  "active": true,
  "tags": ["ai", "builder"],
  "usage": { "tokens": 142000, "limit": 500000 }
}

Five data types: strings (quoted text), numbers, booleans (true/false), arrays (square brackets, ordered), objects (curly braces, key-value pairs). That is the entire format.

Why it matters: Every API call your AI product makes sends and receives JSON. Every database driver serializes to JSON. Every error message your AI tool generates arrives as JSON. When you get a "SyntaxError: Unexpected token" it means the JSON is malformed — a missing comma, an extra bracket, an unquoted key.

AI maker understanding: You can read a JSON payload in the browser's Network tab, identify the field causing a bug, and tell your AI tool exactly what is wrong rather than pasting the entire error and hoping.

2. API (Application Programming Interface)

What it is: An API is a defined interface through which one piece of software talks to another. When your app calls the Stripe API to charge a card, the weather API to get a forecast, or the OpenAI API to call a model — those are all API calls.

An API has three parts: the endpoint (the URL you call), the request (what you send), and the response (what you get back). The API documentation tells you the shape of the request and response for each operation.

Why it matters: Your entire AI product is probably a thin orchestration layer over several APIs — an LLM API, a database API, a payments API, an auth API. Every one of those APIs has rate limits, authentication requirements, and specific error formats. When any of them fails, you get an error in that API's format.

AI maker understanding: When an API call fails, you look up the status code and error message in that API's documentation before asking your AI tool to fix it. Most API errors are documented. The AI tool often does not know which version of an API you are calling.

3. REST

What it is: REST (Representational State Transfer) is the dominant style for web APIs. A REST API structures its endpoints as resources (things) and uses HTTP methods to express actions on those resources:

snippet

GET    /users          → list all users
GET    /users/42       → get user 42
POST   /users          → create a new user
PUT    /users/42       → replace user 42
PATCH  /users/42       → update specific fields of user 42
DELETE /users/42       → delete user 42

REST APIs are stateless: every request contains everything needed to fulfill it. The server does not remember your previous request.

Why it matters: Nearly every API you call — Stripe, GitHub, Anthropic, Vercel, Supabase, Resend — is a REST API. Understanding the resource model helps you understand what endpoint to call, what data to send, and why a 404 on /users/42 means "no user with ID 42" rather than "the /users endpoint doesn't exist."

4. GraphQL

What it is: GraphQL is an alternative to REST where you send a query describing exactly the data you want, and the server returns exactly that — no more, no less.

graphql

query {
  user(id: "42") {
    name
    email
    subscriptions {
      plan
      status
    }
  }
}

One endpoint (/graphql), infinite shapes of data. You decide the response structure, not the API designer.

Why it matters: GitHub's API is GraphQL. Shopify's is GraphQL. Contentful is GraphQL. If your AI tool generates code to fetch GitHub data and uses REST when it should use GraphQL (or vice versa), every request will fail. Knowing which style an API uses is step one.

Key difference from REST: In REST, the server decides the response shape. In GraphQL, you decide. In REST, you might call five endpoints to assemble one screen. In GraphQL, you call one endpoint with a shaped query.

5. Webhooks

What it is: A webhook is an API call that goes in the reverse direction — instead of your app calling the external service, the external service calls your app when something happens.

When a Stripe payment succeeds: Stripe calls POST https://yourapp.com/webhooks/stripe with the payment details. When a GitHub push happens: GitHub calls POST https://yourapp.com/webhooks/github with the commit data. Your app receives the call and reacts.

Why it matters: Real-time updates in your AI product almost always come through webhooks. Email sending (Resend, SendGrid), payments (Stripe), authentication events (Auth0, Clerk), deployment completions (Vercel) — all webhook-driven. If your webhook handler fails, events are silently dropped. There is no polling fallback.

The AI maker trap: Your AI tool can generate a webhook handler. It cannot register the webhook URL in the external service's dashboard. That step requires you. If you skip it, your handler will never receive anything and no error will tell you why.

6. WebSockets

What it is: HTTP is a request-response protocol: you send a request, you get a response, the connection closes. WebSockets are a persistent two-way connection: both sides can send messages at any time without making a new request.

Real-time chat, live collaborative editing, live price feeds, agent streaming output — all WebSocket use cases.

Why it matters: When Claude Code or Cursor streams its response word by word, that is a WebSocket (or SSE — see below) connection. When your AI product needs to push live updates to users without them refreshing, you need WebSockets or SSE.

SSE (Server-Sent Events): A simpler alternative to WebSockets for one-directional streaming (server → client). Most LLM APIs stream their output via SSE. WebSockets are bidirectional (both sides can send). If you are building a chat interface, SSE is often enough and simpler to implement.

7. CSV and Structured Data

What it is: CSV (Comma-Separated Values) is the simplest structured data format — columns separated by commas, rows separated by newlines:

snippet

name,email,plan
Yash,[email protected],pro
Alex,[email protected],free

Other common formats: TSV (tab-separated), JSONL (one JSON object per line — used for LLM training data and structured logs), Parquet (binary columnar format for analytics).

Why it matters: If your AI product handles user data exports, imports, analytics, or fine-tuning datasets, you will encounter CSV and JSONL constantly. The AI tool will generate parsers for them. Knowing the format helps you verify the parser is correct and debug encoding issues (the bane of CSV: commas inside fields, quoted strings, Unicode).

8. Binary Data and Base64

What it is: JSON, CSV, and text APIs transfer text. Images, audio, PDFs, and other binary files cannot be embedded in text directly. Base64 is an encoding that converts binary data into a string of ASCII characters — making it safe to embed in JSON, URLs, or email bodies.

A 100KB image becomes a ~133KB Base64 string (25% overhead from encoding).

Why it matters: Multimodal AI APIs (sending images to vision models) pass images as Base64 strings in the JSON payload. File uploads to S3 or Supabase Storage involve binary streams. When your AI product's image upload breaks, the failure is almost always at the boundary between binary data and the text-based transport layer.

Category 2: HTTP & Networking

9. HTTP Methods

What it is: HTTP defines a set of methods (also called verbs) that describe the intended action of a request:

Method	Intent	Body?	Safe?	Idempotent?
GET	Retrieve data	No	Yes	Yes
POST	Create / submit	Yes	No	No
PUT	Replace entirely	Yes	No	Yes
PATCH	Update partially	Yes	No	Sometimes
DELETE	Remove	Sometimes	No	Yes
OPTIONS	Describe what's allowed	No	Yes	Yes

Safe means the request has no side effects (safe to retry). Idempotent means calling it multiple times produces the same result as calling it once.

Why it matters: Sending a POST twice creates two records. Sending a PUT twice is fine. When your AI tool chooses the wrong method — using POST where PUT is correct — you get duplicate records in production. When a CORS preflight (OPTIONS) fails, your POST never fires and you get a mysterious network error.

10. HTTP Status Codes

What it is: Every HTTP response has a three-digit status code communicating the outcome:

2xx — Success:

200 OK — standard success
201 Created — resource created successfully
204 No Content — success, nothing to return (common on DELETE)

3xx — Redirect:

301 Moved Permanently — URL has changed forever
302 Found — temporary redirect
304 Not Modified — use your cached version

4xx — Client Error (you sent something wrong):

400 Bad Request — malformed request, missing fields, invalid data
401 Unauthorized — not authenticated (no token, expired token)
403 Forbidden — authenticated but not allowed (wrong permissions)
404 Not Found — resource does not exist at this URL
409 Conflict — state conflict (e.g., duplicate unique field)
422 Unprocessable Entity — syntactically valid but semantically wrong
429 Too Many Requests — rate limited

5xx — Server Error (something on their end broke):

500 Internal Server Error — unhandled exception on the server
502 Bad Gateway — upstream server is down
503 Service Unavailable — server is overloaded or in maintenance
504 Gateway Timeout — upstream took too long

Why it matters: A 401 and a 403 look similar but require completely different fixes. A 401 means add or refresh your auth token. A 403 means your auth is fine but you lack permission — a different problem entirely. Your AI tool will often conflate the two.

11. HTTP Headers

What it is: Headers are key-value metadata attached to every HTTP request and response. They carry authentication, content type, cache instructions, CORS permissions, and more.

Common request headers:

snippet

Authorization: Bearer eyJhbGciOiJI...   ← auth token
Content-Type: application/json           ← format of body you're sending
Accept: application/json                 ← format you want back
X-API-Key: sk-1234                       ← API key auth

Common response headers:

snippet

Content-Type: application/json           ← format of the response body
Cache-Control: max-age=3600              ← cache for 1 hour
Access-Control-Allow-Origin: *           ← CORS permission
X-RateLimit-Remaining: 47               ← how many requests you have left

Why it matters: Most authentication problems are header problems. Most CORS errors are missing response headers. Rate limit information is in headers. When your AI tool generates a fetch call that works locally but fails in production, check the headers first.

12. Query Parameters and URL Structure

What it is: A URL has several parts:

snippet

https://api.example.com/users?page=2&sort=name&filter=active#section
│       │               │     │                              │
scheme  host            path  query parameters               fragment

Query parameters (after ?, separated by &) are key-value pairs that filter, paginate, or configure the response. They are part of the URL — bookmarkable, shareable, cacheable.

The path identifies the resource. The query parameters modify how you want it. The fragment (#) is client-only — it is never sent to the server.

Why it matters: Pagination, filtering, and sorting in your AI product almost always go in query parameters. When your AI tool builds a paginated list and page 2 returns the same results as page 1, it is usually because the query parameter is being ignored or incorrectly encoded.

13. HTTPS and TLS

What it is: HTTP transfers data in plain text. Anyone on the same network can read it. HTTPS (HTTP Secure) wraps HTTP in TLS (Transport Layer Security), encrypting all traffic between client and server.

TLS provides:

Encryption — no one in the middle can read the content
Authentication — the server proves it is who it says it is (via certificates)
Integrity — data cannot be modified in transit without detection

Why it matters: If your AI product runs over plain HTTP in production, auth tokens, API keys, and user data are transmitted in plaintext. Any network observer can steal them. Vercel, Netlify, and most modern hosts enforce HTTPS automatically. But if you run your own server or use an internal tool, this is not automatic.

The padlock and the certificate: HTTPS does not mean the site is safe. It means the connection is encrypted. A phishing site can have HTTPS. The certificate only proves you are talking to the server you think you are — not that that server is trustworthy.

What it is: Browsers enforce a Same-Origin Policy — by default, a page at https://yourapp.com cannot make API calls to https://api.yourapp.com (different subdomain) or https://stripe.com (different domain). CORS is the mechanism that lets servers explicitly allow cross-origin requests.

The server responds with headers like:

snippet

Access-Control-Allow-Origin: https://yourapp.com
Access-Control-Allow-Methods: GET, POST, PUT, DELETE
Access-Control-Allow-Headers: Authorization, Content-Type

The preflight: For non-simple requests (POST with JSON, anything with Authorization header), the browser first sends an OPTIONS request to ask "can I make this call?" If the server does not respond with the right headers, the actual request is never sent.

Why it matters: "CORS error" in the browser console means the server is not sending the right CORS headers — not that your frontend code is wrong. The fix is always on the server side. When your AI tool's frontend calls your AI tool's backend and you get a CORS error in local development, it means the backend server is not configured to allow calls from localhost.

15. DNS (Domain Name System)

What it is: DNS translates human-readable domain names into IP addresses. When you visit explainx.ai, your computer asks a DNS server: "what IP address is explainx.ai?" The DNS server returns 76.76.21.21 (or whatever). Your computer connects to that IP.

DNS record types:

A — maps a domain to an IPv4 address
AAAA — maps a domain to an IPv6 address
CNAME — maps a domain to another domain (alias)
MX — specifies email servers for a domain
TXT — arbitrary text, used for domain verification
NS — specifies the authoritative nameservers for a domain

Why it matters: When you connect your custom domain to Vercel, you set a CNAME record pointing yourapp.com to cname.vercel-dns.com. When you add Google Workspace, you add MX records. When you verify domain ownership for any service, you add a TXT record. DNS changes propagate slowly (minutes to 48 hours) — this is why "just wait for DNS to propagate" is standard advice.

TTL: Every DNS record has a Time To Live — how long DNS resolvers cache it before asking again. A TTL of 3600 means changes take up to an hour to propagate everywhere.

16. Ports

What it is: A server at IP address 192.168.1.1 runs many services simultaneously. Ports are numbered (0–65535) channels that route network traffic to the right service.

Standard ports:

80 — HTTP
443 — HTTPS
5432 — PostgreSQL
6379 — Redis
3000 — Node.js dev server (convention)
8080 — alternative HTTP dev server
3306 — MySQL

When you visit http://localhost:3000, you are connecting to port 3000 on your local machine. When your database connection string says postgresql://localhost:5432/mydb, you are connecting to PostgreSQL on port 5432.

Why it matters: Port conflicts are common in development — two services trying to use port 3000. "Address already in use" errors mean something is already on that port. lsof -i :3000 (on Mac/Linux) shows you what is using a port.

Category 3: Client, Server & Infrastructure

17. Client vs Server

What it is: In web applications, client is the browser (or mobile app) running on the user's device. Server is a computer running your application logic, accessible over the network.

This distinction matters most for:

Security: Code running on the client is visible to anyone who opens DevTools. Never put secrets (API keys, database credentials, private logic) in client-side code. Anyone can read it.

Trust: You cannot trust client-side input. A user can modify what the client sends. All validation must happen on the server.

Performance: Network round-trips between client and server take time. The goal is to minimize them without moving secret logic to the client.

In Next.js specifically: Server Components run on the server (can access databases directly, never exposed to the browser). Client Components run in the browser (can use state, event handlers, browser APIs). Your AI tool will sometimes generate Server Component code using useState (browser-only) or Client Component code accessing the database directly (a security hole). Knowing this distinction lets you catch these mistakes.

18. Localhost and the Loopback Interface

What it is: localhost is a hostname that always resolves to your own machine — specifically to the IP address 127.0.0.1 (IPv4) or ::1 (IPv6). These are loopback addresses — traffic never leaves your machine, it loops back to itself.

This is why "there's no place like 127.0.0.1" — localhost is home.

When you run a dev server on localhost:3000, it is only accessible from your machine. No one else on the network can reach it (without a tunnel tool like ngrok or Cloudflare Tunnel).

Why it matters: Webhook development is the main pain point. Stripe, GitHub, and other services need to call a URL to deliver webhook events. They cannot call http://localhost:3000/webhooks/stripe — they have no route to your machine. You need a tunnel (ngrok, Cloudflare Tunnel) that creates a public URL forwarding to your localhost server.

19. Environment Variables

What it is: Environment variables are configuration values stored outside your code — in the shell environment, in a .env file, or in a hosting platform's settings — and injected into your application at runtime.

bash

# .env file (NEVER commit this to git)
DATABASE_URL=postgresql://user:pass@localhost:5432/mydb
OPENAI_API_KEY=sk-proj-...
STRIPE_SECRET_KEY=sk_live_...
NEXT_PUBLIC_STRIPE_PUBLISHABLE_KEY=pk_live_...

In Node.js, you access them as process.env.OPENAI_API_KEY. In Python, as os.environ.get("OPENAI_API_KEY").

The NEXT_PUBLIC_ prefix: In Next.js, environment variables are server-only by default. Prefixing with NEXT_PUBLIC_ makes them available to client-side code too — but also makes them visible to every user who opens the page. Only use NEXT_PUBLIC_ for values that are genuinely public (publishable API keys, feature flags).

Why it matters: The .env file must be in your .gitignore. One accidental commit of your .env file to a public GitHub repo will result in your API keys being scraped and misused within minutes. Automated bots monitor every public commit for credential patterns.

20. CDN (Content Delivery Network)

What it is: A CDN is a network of servers distributed globally. When you deploy your frontend to Vercel or Cloudflare, static assets (images, JS, CSS) are copied to servers in dozens of cities. When a user in Tokyo requests your image, they get it from a Tokyo server — not from your origin server in Virginia. Latency: 5ms vs 200ms.

What lives on CDN vs origin:

CDN: static files (HTML, CSS, JS bundles, images, fonts, videos)
Origin: dynamic responses (API calls, database queries, personalized content)

Why it matters: When you deploy and your new code is live but users are seeing old UI, CDN caching is the likely culprit. The CDN served a cached version of your page. Vercel's CDN invalidates automatically on deployment. If you are using Cloudflare in front of Vercel, you may need to purge the Cloudflare cache separately.

Cache-Control headers tell CDNs and browsers how long to cache a response. max-age=0, no-store disables caching entirely. max-age=31536000, immutable caches for a year and tells the CDN it will never change.

21. Serverless Functions

What it is: A serverless function (also called a Lambda, Edge Function, or Cloud Function) is a single function deployed to the cloud without a persistent server. It starts on demand, runs, and stops. You pay per invocation and execution time, not for 24/7 server uptime.

In Next.js, every file in /app/api/ becomes a serverless function. In Vercel, they run on AWS Lambda or Vercel's own edge runtime.

Limitations:

Cold starts: The first request after inactivity spins up a new container (50ms–500ms delay)
Execution timeout: Usually 10–30 seconds max (Vercel default: 10s)
No persistent memory: The function has no state between invocations
No background jobs: The function lives only for the duration of the request

Why it matters: If your AI product needs to run a long LLM inference (30+ seconds), stream it rather than waiting. If you need to run background jobs (send emails, process uploads), use a queue (Inngest, Trigger.dev) rather than trying to do it in the API route.

22. Edge vs Node.js Runtime

What it is: When you deploy to Vercel or Cloudflare, you choose where your code runs:

Node.js runtime: Standard V8 JavaScript, full Node.js APIs (filesystem, crypto, native modules). Runs in a container. Higher latency from user.
Edge runtime: Stripped-down runtime that runs at CDN nodes globally — close to your users. Ultra-low latency. But: no filesystem, limited Node.js APIs, no native modules.

Why it matters: Your AI tool may generate code using fs (filesystem), child_process, or a native npm package — none of which work on the edge runtime. If you see "the module 'fs' is not available on the Edge Runtime" — you either need to switch to the Node.js runtime or rewrite to avoid those APIs.

Geolocation-based features (show content based on user country) work best on the edge. Heavy computation (LLM inference, image processing) needs the Node.js runtime.

Category 4: Authentication & Security

23. Authentication vs Authorization

What it is: These two words are not interchangeable:

Authentication (AuthN): Who are you? Verifying identity. Logging in with email/password, Google OAuth, passkeys.
Authorization (AuthZ): What are you allowed to do? Checking permissions. Can this user delete this resource? Can a free plan user access this feature?

Why it matters: A 401 (Unauthorized) means authentication failed — the user is not logged in or their token is invalid. A 403 (Forbidden) means authorization failed — the user is logged in but does not have permission. These require completely different fixes. Conflating them is the most common auth bug.

24. JWT (JSON Web Tokens)

What it is: A JWT (pronounced "jot") is a compact, self-contained token that encodes information and can be verified without calling a database. It has three parts, Base64-encoded and joined with dots:

snippet

eyJhbGciOiJIUzI1NiJ9.eyJ1c2VySWQiOiI0MiIsInBsYW4iOiJwcm8iLCJleHAiOjE3NTAxNjAwMDB9.HMAC_SIGNATURE

Decoded: header.payload.signature

The payload contains claims (user ID, email, roles, expiration time). The signature (using a secret key) ensures the token has not been tampered with. The server can verify a JWT without a database lookup — it just checks the signature.

Why it matters: When your auth token expires and your user gets logged out unexpectedly, it is because the JWT's exp (expiration) claim was reached. The refresh token flow (exchanging a long-lived refresh token for a new short-lived access token) is the standard fix. Your AI tool will generate this correctly most of the time, but the expiry window and refresh logic are where bugs live.

Never store JWTs in localStorage. They are vulnerable to XSS attacks. Use HttpOnly cookies — the browser sends them automatically but JavaScript cannot access them.

25. OAuth 2.0

What it is: OAuth 2.0 is the protocol behind "Sign in with Google/GitHub/Apple" buttons. Instead of giving your app a user's password, the user logs in with the external service, which issues your app a token confirming the user's identity.

The flow:

User clicks "Sign in with GitHub"
Your app redirects to GitHub's authorization page
User approves
GitHub redirects back to your callback URL with an authorization code
Your server exchanges the code for an access token
Your server uses the access token to get the user's GitHub profile

Why it matters: The code in step 4 is single-use and expires in seconds. If your callback URL is wrong, the user gets a redirect to a page that does not exist. If your server takes too long to exchange the code, it expires and the login fails. When "sign in with Google" is broken, it is almost always a misconfigured redirect URI or an expired code.

26. API Keys

What it is: An API key is a secret string that authenticates requests to an external service. When your app calls the Anthropic API, it sends Authorization: Bearer sk-ant-api03-... in the header. Anthropic's server checks that key against its database, identifies your account, and either allows or rejects the request.

Types:

Secret keys: Never expose to the browser. Start with sk_ in Stripe, sk-ant- in Anthropic.
Publishable keys: Safe for browser use. Start with pk_ in Stripe.
Scoped keys: Keys with limited permissions (read-only GitHub token, specific S3 bucket access).

Why it matters: API key exposure is the most common security incident in AI products built fast. One console.log(process.env.ANTHROPIC_API_KEY) in a Client Component ships your secret to every user's browser. Rotate keys immediately if you suspect exposure. Set spending limits on all API accounts — a leaked key with no limit can run up thousands of dollars in minutes.

27. Sessions and Cookies

What it is: HTTP is stateless — every request is independent. Sessions are how servers maintain state across requests (remember that you are logged in).

A cookie is a small piece of data the server sends in a Set-Cookie response header, and the browser automatically includes in every subsequent request to that domain.

Session flow:

User logs in
Server creates a session record in the database (or generates a JWT)
Server sets a cookie: Set-Cookie: session=abc123; HttpOnly; Secure; SameSite=Strict
Browser sends Cookie: session=abc123 on every subsequent request
Server looks up abc123 in the session store to identify the user

Cookie flags that matter:

HttpOnly — JavaScript cannot read this cookie (blocks XSS)
Secure — only sent over HTTPS (blocks network interception)
SameSite=Strict — not sent on cross-site requests (blocks CSRF)

28. Rate Limiting

What it is: Rate limiting restricts how many requests a client can make in a time window. "429 Too Many Requests" is the rate limit status code. Most APIs have rate limits expressed as X requests per Y seconds/minutes/hours.

Common rate limit headers:

snippet

X-RateLimit-Limit: 100
X-RateLimit-Remaining: 47
X-RateLimit-Reset: 1750160000
Retry-After: 30

Strategies:

Fixed window: 100 requests per minute, resets on the minute
Sliding window: 100 requests in any 60-second period
Token bucket: Credits accumulate over time, burst is allowed

Why it matters: If you build a feature that calls an LLM API in a loop without rate limit handling, one bug sends thousands of API calls and triggers your rate limit — or runs up a large bill. Implement exponential backoff: wait 1 second after the first 429, 2 seconds after the second, 4 after the third, and so on.

29. SQL Injection (and Input Validation)

What it is: SQL injection is an attack where user input is interpreted as SQL code. If your query is:

javascript

const query = `SELECT * FROM users WHERE email = '${userInput}'`;

And the user enters ' OR '1'='1, the query becomes:

sql

SELECT * FROM users WHERE email = '' OR '1'='1'

Which returns every user in the database. With a destructive payload, the attacker can delete tables.

The fix: parameterized queries. Never concatenate user input into SQL strings.

javascript

// Safe — user input is a parameter, not code
const result = await db.query('SELECT * FROM users WHERE email = $1', [userInput]);

Why it matters: Your AI tool will sometimes generate string-concatenated SQL queries, especially when writing raw SQL rather than using an ORM. Any user input that touches a database query must go through parameterized queries or an ORM that handles this automatically (Prisma, Drizzle, SQLAlchemy).

Category 5: Databases

30. SQL vs NoSQL

What it is: The two major database paradigms:

SQL (relational databases): Data lives in tables with defined columns and types. Rows in different tables relate through foreign keys. Queries use SQL. Examples: PostgreSQL, MySQL, SQLite. Best for: structured data with defined relationships, financial records, user data with complex queries.

NoSQL: A catch-all for non-relational databases. Key-value stores (Redis), document stores (MongoDB — JSON documents), graph databases (Neo4j), wide-column stores (Cassandra). Best for: unstructured data, high-volume writes, flexible schemas, caching.

Why it matters: Most AI applications use both. PostgreSQL with pgvector for persistent user data and vector embeddings. Redis for caching and rate limit counters. Knowing which database type fits which problem prevents the classic mistake of trying to run complex relational queries against a document store or storing billions of events in a row-based relational database.

31. Primary Keys and UUIDs

What it is: A primary key uniquely identifies a row in a database table. Every row must have one. Two common approaches:

Auto-incrementing integers: id = 1, 2, 3, 4... Simple, compact, but exposes information (anyone can guess the next ID — if your order is #342, the competitor just placed order #341).

UUIDs (Universally Unique Identifiers): id = "f47ac10b-58cc-4372-a567-0e02b2c3d479" — a random 128-bit value. Not guessable. Can be generated client-side without a database round-trip. Larger storage size.

ULIDs and NanoIDs: Alternatives to UUID with timestamp-ordering (ULID) or shorter strings (NanoID — 21 characters). Many modern projects use these.

Why it matters: If your AI product exposes resource IDs in URLs (/orders/342), sequential integers are a security issue — a user can enumerate all orders. UUIDs in URLs prevent this. Your AI tool will often generate integer primary keys by default; explicitly ask for UUIDs in your schema.

32. Foreign Keys and Relations

What it is: A foreign key is a column in one table that references the primary key of another table, creating a relationship between them.

sql

CREATE TABLE users (id UUID PRIMARY KEY, email TEXT);
CREATE TABLE posts (
  id UUID PRIMARY KEY,
  user_id UUID REFERENCES users(id) ON DELETE CASCADE,
  content TEXT
);

posts.user_id references users.id. This relationship lets you join tables:

sql

SELECT posts.content, users.email FROM posts JOIN users ON posts.user_id = users.id;

Cascade behavior: ON DELETE CASCADE means if the user is deleted, all their posts are deleted automatically. ON DELETE RESTRICT prevents deletion if related records exist. ON DELETE SET NULL sets the foreign key to null.

Why it matters: If you delete a user without CASCADE set up, and the app tries to display their orphaned posts, it crashes. Choosing the right cascade behavior is an architectural decision your AI tool will make for you — often incorrectly for your specific use case.

33. Database Indexes

What it is: An index is a data structure that speeds up queries on specific columns. Without an index, a query like SELECT * FROM users WHERE email = '[email protected]' scans every row in the table (O(n)). With an index on email, it is a binary tree lookup (O(log n)).

The cost: indexes take disk space and slow down writes (every INSERT/UPDATE must update the index too). You trade write performance for read performance.

What to index:

Columns you filter on frequently (WHERE email = ?)
Columns you sort on (ORDER BY created_at)
Columns used in JOINs (foreign keys)
Unique constraints (automatically creates an index)

Why it matters: A table with 100 rows is fast without indexes. The same table with 10 million rows, queried without an index, will time out in production. Your AI tool will rarely add indexes to generated schemas. When your query goes from instant in development to 30 seconds in production, a missing index is the most likely cause.

34. Database Migrations

What it is: A migration is a script that describes a change to the database schema — adding a table, adding a column, removing a column, changing a data type. Migrations run in sequence and are version-controlled alongside your code.

sql

-- 0003_add_subscription_tier.sql
ALTER TABLE users ADD COLUMN subscription_tier TEXT DEFAULT 'free';
CREATE INDEX idx_users_subscription_tier ON users(subscription_tier);

Why migrations matter: When your code expects a column that does not exist in production because the migration was not run, every API call that touches that table throws a 500. When you roll back code without rolling back the migration, columns the old code does not know about cause confusion. Migrations make schema changes reproducible and reversible.

With Prisma: prisma migrate dev generates and applies migrations from your schema changes. prisma migrate deploy runs pending migrations in production. Never edit a migration file after it has been applied.

35. ORMs (Object-Relational Mappers)

What it is: An ORM maps database tables to code objects, letting you write queries in your programming language instead of SQL.

typescript

// Prisma ORM — TypeScript
const user = await prisma.user.findUnique({
  where: { email: '[email protected]' },
  include: { posts: true },
});

vs raw SQL:

sql

SELECT users.*, posts.* FROM users
LEFT JOIN posts ON posts.user_id = users.id
WHERE users.email = '[email protected]';

Common ORMs: Prisma (TypeScript), Drizzle (TypeScript, SQL-first), SQLAlchemy (Python), ActiveRecord (Ruby).

The tradeoff: ORMs make common queries fast to write and type-safe. They make complex queries harder — sometimes generating inefficient SQL. The N+1 query problem (fetching a list of users, then querying for each user's posts individually) is the classic ORM trap. Your AI tool will sometimes generate N+1 queries; recognize them by seeing hundreds of identical queries in your logs.

36. Connection Pooling

What it is: Opening a database connection is expensive — it involves network handshakes, authentication, and resource allocation. A connection pool maintains a set of pre-opened connections and reuses them across requests.

Without pooling: every request opens a connection, uses it, closes it. With a pool: connections are checked out, used, and returned.

Why it matters in serverless: Each serverless function invocation may try to open its own database connection. With thousands of concurrent invocations, you exhaust the database's connection limit (PostgreSQL default: 100). PgBouncer and Supabase Pooler solve this for PostgreSQL. Neon and PlanetScale have built-in connection pooling. Your AI tool may generate a database client that opens a new connection per request — correct for a long-running server, fatal for serverless.

Category 6: Code Fundamentals

37. Async/Await and Promises

What it is: JavaScript is single-threaded. If a database query takes 200ms and you block the thread waiting, nothing else can happen for 200ms. Promises are JavaScript's way of representing a future value. Async/await is syntax sugar over Promises.

javascript

// This blocks — bad
const user = database.query('SELECT * FROM users WHERE id = 1'); // 200ms wait

// This does not block
const user = await database.query('SELECT * FROM users WHERE id = 1');
// While waiting, other requests can be handled

await can only be used inside an async function. A function that uses await returns a Promise.

The mistake: Forgetting await:

javascript

const user = prisma.user.findUnique({ where: { id: '42' } });
// user is a Promise object, not a user record
console.log(user.email); // undefined — the query hasn't run yet

This is one of the most common bugs in AI-generated code. If you see undefined where you expect a database value, check for a missing await.

38. Error Handling (try/catch)

What it is: When code throws an error, execution stops and the error propagates up the call stack until something catches it. In async code, unhandled Promise rejections crash your server or silently fail in the browser.

javascript

try {
  const user = await prisma.user.findUnique({ where: { id } });
  if (!user) throw new Error('User not found');
  return user;
} catch (error) {
  console.error('Failed to fetch user:', error);
  throw error; // re-throw so the caller knows it failed
}

The finally block: Runs regardless of success or failure. Used for cleanup (closing connections, releasing locks).

Why it matters: Without error handling, an unhandled exception in an API route returns a 500 with no useful message. With error handling, you can return a 404 when a record is not found, a 400 when input is invalid, and a 500 only when something truly unexpected happens. Your AI tool will often generate happy-path code without error handling; add it explicitly.

39. TypeScript Types

What it is: TypeScript adds static types to JavaScript. Types describe the shape of your data and are checked at compile time — catching bugs before the code runs.

typescript

type User = {
  id: string;
  email: string;
  plan: 'free' | 'pro' | 'enterprise';
  createdAt: Date;
};

function getUserEmail(user: User): string {
  return user.email; // TypeScript knows this exists and is a string
}

Why it matters for AI products: Your AI tool generates TypeScript. The types it generates reflect its assumptions about your data. When the AI generates a type that does not match your actual database schema, TypeScript will flag it — before you ship. Treating TypeScript errors as warnings and using any everywhere defeats the purpose.

The as escape hatch: user as unknown as AdminUser tells TypeScript "trust me, I know better." Use it sparingly. Every as is a place where runtime errors can hide from the type system.

40. Null, Undefined, and Optional Chaining

What it is: JavaScript has two "nothing" values:

null — explicitly set to nothing (const user = null means "intentionally empty")
undefined — not yet set (const user; — no value assigned)

In TypeScript, ? marks a field as optional (can be undefined):

typescript

type User = {
  email: string;
  phone?: string; // might not exist
};

Optional chaining (?.): Safely access nested properties that might be null/undefined:

javascript

const city = user?.address?.city; // returns undefined instead of throwing

Nullish coalescing (??): Provide a default:

javascript

const plan = user?.plan ?? 'free'; // 'free' if user.plan is null/undefined

Why it matters: "Cannot read properties of undefined (reading 'email')" is the most common JavaScript runtime error. It means you accessed a property on something that was null or undefined. In AI-generated code, this usually means an API response was assumed to have a field that sometimes does not exist, or an async operation was not awaited.

41. State and Side Effects

What it is: In a frontend application, state is data that can change and, when changed, causes the UI to re-render. In React, useState manages local component state.

typescript

const [count, setCount] = useState(0);
// count is the current value
// setCount is the function to update it

A side effect is anything that interacts with the world outside the component — fetching data, updating the DOM directly, setting up subscriptions. useEffect in React runs side effects after render.

typescript

useEffect(() => {
  fetch('/api/user').then(r => r.json()).then(setUser);
}, []); // empty array = run once on mount

Why it matters: Infinite render loops (the most common React bug) happen when a useEffect updates state without proper dependency management. If your AI product's dashboard refreshes infinitely or a component keeps re-fetching data in a loop, the useEffect dependency array is wrong.

42. Caching

What it is: Caching stores the result of an expensive operation so subsequent requests can use the stored result instead of repeating the operation.

Layers of caching in a web app:

Browser cache — the browser stores responses according to Cache-Control headers
CDN cache — the CDN stores static assets and sometimes dynamic responses
Application cache — Redis or an in-memory store holds computed values
Database query cache — the database or ORM caches query results
Function-level memoization — caching expensive function results in memory

Cache invalidation — deciding when a cached value is stale and should be refreshed — is famously hard. "There are only two hard things in computer science: cache invalidation and naming things." Stale cache is why your AI product sometimes shows users data that changed hours ago. Time-based invalidation (TTL — cache for N seconds, then expire) is the standard solution.

Category 7: Infrastructure & DevOps

43. Environment: Development, Staging, Production

What it is: Software lives in multiple environments during its lifecycle:

Development (local): Runs on your machine. Connects to a local or dev database. Has verbose logging. Costs nothing.
Staging (preview): A production-like environment for testing before release. Same stack as production, but with test data. Vercel calls these "Preview Deployments."
Production (prod): The live environment your users interact with. Real data. Real money. Mistakes here matter.

Why it matters: An action that is safe in development (wiping the database to test a migration, logging all API payloads) is catastrophic in production. The most important practice: production database connections should never be in a .env file that anyone casually runs. Separate credentials for each environment.

The .env.local convention: Development secrets. .env.production for production secrets (managed via hosting platform environment variables, not committed to git).

44. CI/CD (Continuous Integration / Continuous Deployment)

What it is: CI/CD is the practice of automating the steps between writing code and shipping it to users.

CI (Continuous Integration): On every push, automated tests run. If tests fail, the code is not merged. This prevents broken code from reaching the main branch.

CD (Continuous Deployment): When code is merged, it automatically deploys to staging or production. Vercel does this by default — push to main, the site updates.

Why it matters: Without CI, one developer's untested code breaks production and the team spends Saturday debugging. Without CD, deployment is a manual, error-prone ceremony. Both are things your AI tool can help set up (GitHub Actions for CI, Vercel for CD) but you need to understand them to configure the right triggers and failure conditions.

45. Logs

What it is: Logs are timestamped records of events in your running application. Every console.log, console.error, and console.warn in Node.js goes to your application logs.

Log levels:

DEBUG — verbose, only in development
INFO — normal events (user logged in, payment processed)
WARN — unexpected but recoverable (retry after rate limit)
ERROR — failures that need attention (database query failed)

Why it matters: When your AI product has a bug in production that you cannot reproduce locally, logs are the only visibility you have. In Vercel: Functions → Logs. In Cloudflare: Workers → Logs. The pattern "I can't reproduce it locally" usually means an environment-specific condition (different env var, different data, race condition) that only logs can reveal.

Structured logging: Logging JSON objects rather than plain strings makes logs searchable: console.log({ event: 'payment_failed', userId, error: e.message }).

46. Docker and Containers

What it is: A Docker container packages your application with all its dependencies (runtime, libraries, configuration) into a portable unit that runs identically everywhere. "It works on my machine" becomes irrelevant — the container is the machine.

A Dockerfile describes how to build the image:

dockerfile

FROM node:20-alpine
WORKDIR /app
COPY package*.json ./
RUN npm install
COPY . .
RUN npm run build
EXPOSE 3000
CMD ["npm", "start"]

Why it matters: If your AI product has a complex backend (custom Python inference, specific system library versions, GPU requirements), Docker is how you ensure it runs the same in development and production. Vercel and most PaaS platforms handle this for you. But when you need a custom runtime or a non-standard stack, you write a Dockerfile.

Category 8: AI-Specific Concepts

47. Tokens (LLM)

What it is: Large language models do not process text character by character or word by word. They process tokens — chunks of text that roughly correspond to common word fragments.

"vibecoding" → ["vibe", "coding"] — 2 tokens "JSON" → ["JSON"] — 1 token "antidisestablishmentarianism" → ["anti", "dis", "estab", "lish", "ment", "arian", "ism"] — 7 tokens

A rough rule: 1 token ≈ ¾ of an English word. 100 tokens ≈ 75 words.

Why pricing is in tokens: Anthropic charges $3/M input tokens, $15/M output tokens (Sonnet 4.6). A typical conversation with 10 exchanges might use 2,000 tokens. A Claude Code session on a large codebase might use 200,000 tokens. A loop engineering session overnight might use 5,000,000 tokens.

Why it matters: Knowing roughly how many tokens your AI workflow uses helps you estimate cost, stay within context windows, and understand why long conversations degrade in quality (the model has less capacity to "remember" early context when the context window fills).

48. Context Window

What it is: The context window is the maximum number of tokens an LLM can process in a single call — the total of what you send it (system prompt + conversation history + retrieved documents) plus what it generates in response.

Current context windows:

Claude Opus 4.8: 200,000 tokens
GPT-5.5: 128,000 tokens
Gemini 2.0 Pro: 1,000,000 tokens

What goes in the context window:

System prompt
Conversation history
Retrieved documents (RAG)
Tool call results
The response being generated

Why it matters: When a Claude Code session gets very long and the model starts "forgetting" things you told it earlier or making contradictory decisions, it is because the context window is filling up and earlier content is being compressed or dropped. /compact in Claude Code summarizes and compresses the context. Understanding context limits helps you design your AI prompts more efficiently.

49. Embeddings and Vector Search

What it is: An embedding is a numerical representation of text (or an image, or audio) as a vector — a list of hundreds or thousands of numbers. Texts with similar meanings have vectors that are close together in this high-dimensional space.

snippet

"dog" → [0.23, -0.17, 0.84, ..., 0.02]  (1536 numbers)
"puppy" → [0.25, -0.15, 0.81, ..., 0.04] ← close to "dog"
"database" → [0.61, 0.43, -0.22, ..., 0.77] ← far from "dog"

Vector search: Store embeddings in a vector database (pgvector in PostgreSQL, Pinecone, Weaviate, Qdrant). At query time, embed the user's question and find the stored vectors closest to it. Return the documents those vectors came from.

Why it matters: Semantic search (find documents similar in meaning, not just containing the same keywords), recommendation systems, duplicate detection, and RAG all run on embeddings. If your AI product has a "search your data" feature, it probably uses vector search.

50. RAG (Retrieval-Augmented Generation)

What it is: A large language model has a knowledge cutoff — it does not know about events after its training data was collected, and it does not know about your private data. RAG is the technique of retrieving relevant documents at query time and including them in the context window, giving the model access to current, private, or domain-specific knowledge.

The RAG pipeline:

Ingest: Split your documents into chunks, embed each chunk, store vectors in a database
Query: Embed the user's question
Retrieve: Find the N most similar chunks using vector search
Generate: Pass question + retrieved chunks to the LLM: "Answer this question using the following documents: [chunks]"

Why it matters: Every AI product with a "chat with your data" feature uses RAG. Every AI assistant that needs to know about recent events uses RAG. The quality of your AI product depends critically on:

Chunk size (too small = no context; too large = irrelevant noise)
Embedding model choice (better embeddings = better retrieval)
Number of retrieved chunks (too few = miss relevant info; too many = dilute context)
Reranking (a second model that re-scores retrieved chunks for relevance)

RAG failures look like confident, wrong answers — the model retrieved a slightly irrelevant document and hallucinated the rest. When your AI product gives plausible but incorrect answers, retrieval quality is the first thing to debug.

Putting It Together: The AI Maker's Mental Model

A user opens your AI product. Here is what happens — and which of these 50 concepts is at work:

They type their URL and hit Enter → DNS (15) resolves the domain to an IP
Their browser connects over HTTPS/TLS (13)
The CDN (20) serves your static frontend from a nearby server
They click "Sign in with Google" → OAuth 2.0 (25) flow begins
Google redirects back with a code → your serverless function (21) exchanges it for a JWT (24)
The JWT is stored in an HttpOnly cookie (27)
They perform a search → REST API (3) call to your backend with JSON (1) body and Authorization header (11)
Your backend checks the JWT for authentication (23) and authorization (24)
It queries your PostgreSQL (30) database using an ORM (35) with indexes (33) on the search column
Results are cached in Redis (42) for subsequent identical queries
A matching document's text is embedded (49) and compared against your vector index via pgvector
The top matches go into a RAG (50) prompt with the user's question
The prompt is sent to the Anthropic API — your API key (26) in the header, the prompt consuming tokens (47) within the context window (48)
The response streams back via SSE (6) to the browser
The browser state (41) updates and the component re-renders
Logs (45) record the search event with user ID, latency, and token count
If the Anthropic API returns 429 (10) — rate limited — your code waits and retries with exponential backoff (28)

All 50 concepts. One user action. Now you can read the system you are building.

The Three Concepts That Pay Off Fastest

If you stop here and only learn three things:

HTTP status codes (concept 10) — read the three-digit number before reading the error message. It tells you what category of problem you have before any debugging.
Environment variables (concept 19) — your secrets live here, not in your code. One .env committed to a public repo ends the project. Check your .gitignore before the first git push.
Async/await and missing awaits (concept 37) — if a database value is undefined when it should not be, there is probably a missing await on the query. This is the single most common bug in AI-generated JavaScript/TypeScript.

Everything else compounds. But start here.

Related posts

"Who Is JSON?" — The Vibecoding Moment That Broke the Internet (And What It Actually Means)

Claude Code Artifacts: Shareable AI Sessions vs Lovable, Codex Sites, and v0

video-use: Edit Videos With Claude Code — No Premiere Pro Needed

Category 1: Data & Formats

1. JSON

2. API (Application Programming Interface)

3. REST

4. GraphQL

5. Webhooks

6. WebSockets

7. CSV and Structured Data

8. Binary Data and Base64

Category 2: HTTP & Networking

9. HTTP Methods

10. HTTP Status Codes

11. HTTP Headers

12. Query Parameters and URL Structure

13. HTTPS and TLS

14. CORS (Cross-Origin Resource Sharing)

15. DNS (Domain Name System)

16. Ports

Category 3: Client, Server & Infrastructure

17. Client vs Server

18. Localhost and the Loopback Interface

19. Environment Variables

20. CDN (Content Delivery Network)

21. Serverless Functions

22. Edge vs Node.js Runtime

Category 4: Authentication & Security

23. Authentication vs Authorization

24. JWT (JSON Web Tokens)

25. OAuth 2.0

26. API Keys

27. Sessions and Cookies

28. Rate Limiting

29. SQL Injection (and Input Validation)

Category 5: Databases

30. SQL vs NoSQL

31. Primary Keys and UUIDs

32. Foreign Keys and Relations

33. Database Indexes

34. Database Migrations

35. ORMs (Object-Relational Mappers)

36. Connection Pooling

Category 6: Code Fundamentals

37. Async/Await and Promises

38. Error Handling (try/catch)

39. TypeScript Types

40. Null, Undefined, and Optional Chaining

41. State and Side Effects

42. Caching

Category 7: Infrastructure & DevOps

43. Environment: Development, Staging, Production

44. CI/CD (Continuous Integration / Continuous Deployment)

45. Logs

46. Docker and Containers

Category 8: AI-Specific Concepts

47. Tokens (LLM)

48. Context Window

49. Embeddings and Vector Search

50. RAG (Retrieval-Augmented Generation)

Putting It Together: The AI Maker's Mental Model

The Three Concepts That Pay Off Fastest

Related Reading