Container Security: Image Scanning and Vulnerability Management

Implement comprehensive container security: from scanning images for vulnerabilities to runtime security monitoring and secrets protection.

published: March 25, 2026 reading time: 28 min read author: GeekWorkBench updated: June 17, 2026

Quick Summary

Container security means layering defenses across the whole build-deploy lifecycle. Trivy or Grype scan every image in CI before it reaches a registry, catching CVEs at build time rather than after deployment. Pin base images to digests, not tags, so :latest cannot mutate underneath you. Run containers as non-root, mount filesystems read-only, and drop all capabilities by default—Pod Security Standards enforce these baselines cluster-wide. Cosign signs images cryptically; a policy agent like Kyverno or OPA Gatekeeper rejects unsigned images at admission time. Runtime tools like Falco watch for anomalous syscalls (shell spawning, crypto mining) that no scanner catches. Secrets belong in external stores, injected at runtime—never baked into images or passed as environment variables. One tool does not cover everything: build-time scanning, runtime monitoring, and signing work together.

Introduction

Image Signing (Cosign) vs. Not

Use image signing when you deploy to production environments where you need to verify that only images you intentionally built reach your cluster. Signing matters most in multi-team environments where anyone could push to your registry, or when you pull base images from third parties.

Do not use image signing if you are the only person building and deploying images in a small team with a private registry. The operational overhead of key management exceeds the security benefit until you have multiple contributors.

Image signing works by having you generate a key pair with Cosign, sign your image digest after building, and push the signature as a separate OCI artifact alongside your image. When Kubernetes pulls the image, a policy controller like Kyverno or OPA Gatekeeper queries the registry for a valid signature before admitting the pod. If the signature is missing or invalid, the pod is rejected at admission time. An attacker who compromises your CI pipeline and pushes a modified image cannot get it to run in production without also stealing your Cosign private key.

Key management trips up most teams. If you lose the private key, you cannot sign anything new. If someone steals it, they can sign malicious images. Store the private key in a KMS (AWS KMS, Google Cloud KMS, HashiCorp Vault) and have your CI pipeline retrieve it at build time. Never commit the key to your repository. If you are just starting out, set up a hardware security module or cloud KMS integration from day one. Migrating key storage later is painful.

The failure mode that kills image signing adoption is teams signing images manually and forgetting to sign images built from CI. When a CI-built image gets rejected at admission because nobody signed it, trust in the entire system collapses. Automate signing as the last step of your build pipeline, not as an optional manual action. If your pipeline does not sign, it does not deploy.

Falco vs. Other Runtime Security Tools

Use Falco when you want open-source runtime security with an active community and Kubernetes-native integration. Falco has the largest rule set community and works well with standard Kubernetes logging.

Use alternatives like Sysdig Falco Enterprise or Aqua Security when you need commercial support, specific compliance framework integrations, or tighter integration with your SIEM.

Falco works by attaching to Linux kernel syscalls through a kernel module or eBPF probe. It evaluates system call events against a rules engine and fires alerts when behavior matches a rule. Rules are YAML files that define conditions like “bash spawned inside a container,” “writing to /etc/shadow,” or “network connection from a container to an external IP without a corresponding process.” The Falco project maintains a default rule set that covers common container escape techniques, crypto mining, and privilege escalation patterns.

The operational reality is that Falco generates noise until you tune it. Default rules fire constantly in most environments because your application legitimately reads configuration files, spawns shell processes, and makes network connections. Tuning means looking at which rules fire, identifying which are legitimate behavior for your workload, and creating suppression rules for those. Without tuning, Falco alerts become unreadable noise and security teams start ignoring them. Plan to spend the first two weeks of Falco deployment entirely on tuning.

Sysdig Falco Enterprise adds a commercial support tier, a pre-built library of application-specific rules for common middleware (Redis, PostgreSQL, nginx), and integration with Sysdig’s threat detection platform. If you are running a heterogeneous environment with many third-party services, the pre-built rules save significant tuning time. Aqua Security takes a different approach, focusing on image scanning, container firewalling, and runtime protection in an integrated platform. If you already use Aqua for image scanning, its runtime protection integrates more tightly than running Falco alongside a separate image scanning tool.

AppArmor and Seccomp: When They Are Overkill

AppArmor and Seccomp profiles are worth the effort for regulated environments (financial services, healthcare) or for workloads handling sensitive data. The performance overhead is minimal and the blast radius reduction is significant.

Do not invest in custom Seccomp profiles for stateless microservices with no external network access. The operational cost of maintaining profiles exceeds the risk reduction for low-sensitivity workloads. Use the default Docker Seccomp profile instead.

Seccomp operates at the kernel syscall level. By default, Linux allows a process to make hundreds of different syscalls. Seccomp lets you whitelist which syscalls a container can use. Docker ships with a default seccomp profile that blocks about 44 dangerous syscalls including mount, syslog, and perf_event_open. This default blocks a lot of attack surface without any configuration on your part. You hit problems when you try to build a custom restrictive profile for an application you do not fully understand: applications often need more syscalls than you expect, and a too-restrictive seccomp profile causes mysterious failures.

Building a custom seccomp profile is an iterative process. You start with a permissive profile and monitor which syscalls the application actually makes using tools like strace. Then you build a deny-by-default profile that allows only the syscalls you observed. This takes time and the profiles need updating when your application changes. For most workloads, the default Docker seccomp profile is sufficient. Custom profiles are premature optimization.

AppArmor works at a higher abstraction level than Seccomp. Instead of blocking syscalls directly, AppArmor controls access to files, capabilities, and network resources based on profiles attached to a process. A container with an AppArmor profile cannot read /etc/shadow even if it somehow obtains the capability to open files. AppArmor profiles are easier to write because they deal with application resources rather than raw syscalls, but they require the AppArmor kernel module to be loaded and profiles to be loaded into the kernel.

The real-world overhead of Seccomp and AppArmor is minimal. There is a measurable difference in microbenchmarks but real-world application performance is dominated by I/O and network wait times, not syscall filtering. The performance argument against these tools rarely holds up under actual production load. The cost is in maintenance: profiles need updates as applications evolve, and debugging a container that fails due to a missing syscall permission wastes time. Start with defaults, add AppArmor for network-level controls on sensitive workloads, and only build custom seccomp profiles when you have a specific threat model that requires it.

Image Scanning with Trivy and Grype

Scan every image before it touches your cluster. Not sometimes. Not in staging only. Every image, every push, in your CI pipeline.

Trivy is the default choice for most teams. It is fast, has a large vulnerability database, and integrates with most CI systems.

# Install Trivy
brew install trivy

# Scan an image
trivy image myregistry/myapp:latest

# Scan in CI with exit code on high vulnerabilities
trivy image --exit-code 1 --severity HIGH,CRITICAL myregistry/myapp:latest

Grype is another option, particularly if you want to scan SBOMs (Software Bills of Materials) or need a different database backend.

# Install Grype
brew install grype

# Scan with SBOM input
grype sbom:./sbom.json

# JSON output for automation
grype image myregistry/myapp:latest -o json > results.json

Both tools pull from multiple vulnerability databases including the Ubuntu, Debian, and Alpine security feeds, plus the Python Package Index and npm registry.

SBOM Generation and Vulnerability Tracking

An SBOM is a formal record of the packages and dependencies in your software. Think of it as an ingredient list for your container image.

Generate SBOMs at build time:

# Generate SBOM with Syft
syft myregistry/myapp:latest -o spdx-json > sbom.spdx.json

# Or in Dockerfile build with buildpacks
pack build --builder heroku/buildpacks:20 myregistry/myapp:latest

SBOMs serve two purposes. First, when a new vulnerability drops (like Log4Shell), you can query your SBOM database to find every image affected in minutes, not hours. Second, SBOMs give you audit trails for compliance.

Store SBOMs alongside your images in a registry that supports it, or in a separate artifact storage.

Runtime Security with Falco

Scanning images at build time catches known vulnerabilities. Falco catches anomalous behavior at runtime, things that are not in any vulnerability database because they are specific to your environment.

Falco works by monitoring system calls. You define rules for behavior you consider suspicious:

# falco_rules.yaml
- rule: Detect shell in container
  desc: A shell was spawned inside a container
  condition: >
    container and
    proc.name = bash
  output: >
    Shell spawned in container
    (user=%user.name container=%container.name
    image=%container.image.repository)
  priority: WARNING

- rule: Detect crypto mining
  desc: Detect execution of known crypto miner
  condition: >
    spawned_process and
    proc.name in (cpuminer, nanominer, ethminer)
  output: >
    Crypto miner detected
    (user=%user.name command=%proc.cmdline)
  priority: CRITICAL

Deploy Falco as a DaemonSet in your cluster. It will generate events for every suspicious behavior it sees.

Non-Root Users and Read-Only Root Filesystems

Design your containers to run as non-root by default. This is harder than it sounds because many official images run as root internally.

# Create a non-root user in your Dockerfile
RUN addgroup -S appgroup && adduser -S appuser -G appgroup
USER appuser

# If you must run as root, switch before running the app
USER root
RUN some-privileged-operation
USER appuser

Pair non-root users with read-only filesystems. If an attacker compromises your container, they cannot write to the filesystem.

# Kubernetes pod spec
securityContext:
  readOnlyRootFilesystem: true
  runAsNonRoot: true
  runAsUser: 10000

You will need to identify which directories need write access and mount them as volumes.

Seccomp and AppArmor Profiles

Seccomp (secure computing mode) restricts the system calls a container can make. By default, containers can make hundreds of system calls. Seccomp lets you whittle that down to the handful your application actually needs.

{
  "defaultAction": "SCMP_ACT_ERRNO",
  "architectures": ["SCMP_ARCH_X86_64"],
  "syscalls": [
    {
      "names": ["read", "write", "exit", "sigreturn"],
      "action": "SCMP_ACT_ALLOW"
    }
  ]
}

AppArmor works at a higher level, controlling file access, capabilities, and network access based on profiles.

# Apply an AppArmor profile to a container (in Kubernetes with containerd)
container.apparmor.security.alpha.kubernetes.io/runtimeclass: "runtime/default"

Docker applies a default seccomp profile that blocks about 44 system calls. Kubernetes does not apply any default seccomp profile, so you need to set it explicitly if you want it.

Supply Chain Security

The SolarWinds and Codecov breaches showed what happens when attackers compromise upstream supply chains. Your containers are only as secure as their dependencies.

flowchart LR
    A[Image Build] --> B[Trivy Scan]
    B --> C{ Vulnerabilities found? }
    C -->|High/Critical| D[Block Deploy]
    C -->|None/Low| E[Generate SBOM]
    E --> F[Cosign Sign]
    F --> G[Push to Registry]
    G --> H[Kyverno Policy Check]
    H --> I{ Signature Valid? }
    I -->|No| J[Reject Pod]
    I -->|Yes| K[Deploy to Cluster]
    K --> L[Falco Runtime Monitor]
    L --> M[Alert on Anomaly]

Pin base images to specific digests, not tags. Tags are mutable; a node:18-alpine today is not the same as node:18-alpine in six months.

# Pin to digest, not tag
FROM node@sha256:a1b2c3d4e5f6... as builder

Use image signing. Cosign (part of Sigstore) lets you sign images and verify signatures at runtime.

# Sign an image
cosign sign --key cosign.key myregistry/myapp:latest

# Verify in Kubernetes with Kyverno
kubectl apply -f - <<EOF
apiVersion: kyverno.io/v1
kind: ClusterPolicy
metadata:
  name: require-signed-images
spec:
  validationFailureAction: enforce
  match:
    any:
    - resources:
        kinds:
        - Pod
EOF

Production Failure Scenarios

Failure	Impact	Mitigation
Trivy blocking deployment for critical CVE with no immediate patch	Build pipeline halts, deployment delayed	Establish a vulnerability exception process with risk acceptance sign-off, prioritize CVEs by exploitability (EPSS score) over severity alone
Falco false positives causing alert fatigue	Security team ignores alerts, real threats missed	Tune Falco rules to your environment, suppress known-false positives, review rule effectiveness quarterly
Container running as root escaping to host	Attacker gains host access, full cluster compromise	Enforce `runAsNonRoot: true` in PodSecurityPolicy, fail builds that produce root containers
Supply chain compromise via malicious base image	Backdoored image deployed to production	Pin base images to digests, use Cosign signature verification, scan all third-party images in CI
`:latest` tag image mutation causing inconsistency	Different nodes run different image versions, unpredictable behavior	Always tag builds with commit SHA, never pull `:latest` in production

Trade-off Analysis

Scenario	Trivy	Grype	Notes
Vulnerability database size	Large	Large	Both cover major OS and language package feeds
SBOM generation	Via Syft	Native	Grype handles SBOMs directly; Trivy requires Syft as separate step
CI integration	Native	Native	Both exit non-zero on findings
JSON output for automation	Yes	Yes	Both produce structured output
Speed (large images)	Fast	Fast	Comparable performance

Scenario	Falco (runtime)	Prevention-only	Notes
Detects zero-days	Yes	No	Runtime monitoring catches novel attacks
Performance overhead	Low (~5%)	None	Falco adds minimal latency
Requires tuning	Yes	No	Falco needs rule customization per environment
Compliance value	Medium	Low	Falco provides audit trail for behavior

Scenario	Image signing required	Signing optional	Notes
Multi-team registry	Yes	No	Signature verification prevents unauthorized pushes
Single-person builds	No	Yes	Key management overhead exceeds risk without multiple contributors
Regulated environments	Yes	No	SOC 2, PCI-DSS often require artifact signing

Scenario	Rootless containers	Privileged containers	Notes
Security posture	Strong	Weak	Rootless significantly reduces container escape impact
Compatibility	Most apps work	Legacy apps may need root	Worth migrating legacy apps rather than running privileged
Performance	No overhead	No overhead	No reason to use privileged containers

Container Security Observability

Monitor CVE counts per image as a metric in your CI pipeline. A spike in critical CVEs for an image you have not changed means one of your dependencies released a bad update. Set up alerts when image scan results change between builds.

Falco alert volume per rule tells you which rules are worth keeping. Rules that fire hundreds of times a day are noise. Suppress or remove them so real anomalies stand out.

Track container restart rates. Containers that restart every few minutes are either crashing or being evicted repeatedly. Both are worth investigating.

Key commands:

# Trivy scan with JSON output for metrics extraction
trivy image --exit-code 1 --severity HIGH,CRITICAL --format json myregistry/myapp:latest > scan-results.json

# Count CVEs by severity
jq '[.Results[].Vulnerabilities[]?.Severity] | group_by(.) | map({severity: .[0], count: length})' scan-results.json

# Falco alert volume by rule in the last hour
kubectl logs -l app=falco -n falco --since=1h | jq -r '.rule' | sort | uniq -c | sort -rn

# List images with most critical vulnerabilities across your cluster
kubectl get pods -A -o jsonpath='{range .items[*]}{.spec.containers[*].image}{"\n"}' | tr ' ' '\n' | sort -u | while read img; do echo "$img: $(trivy image --quiet --severity CRITICAL "$img" 2>/dev/null | grep -c CRITICAL || echo 0)"; done | sort -t: -k2 -rn | head -10

Common Pitfalls / Anti-Patterns

Running containers as root. Many official images run as root internally. If your container escapes, the attacker has root on the host. Use runAsNonRoot: true and design your images with a non-root user from the start.

Not scanning images. Skipping scans to speed up builds means known vulnerabilities reach production. Block high and critical CVEs in CI. If the build cannot pass, that is the signal to fix the dependency.

Using the :latest tag. When you pull node:18-alpine, you get whatever node:18-alpine means today. Pin to digests: node:18-alpine@sha256:abc123.... Test your builds against a fixed version.

Not signing images. In any environment where untrusted parties can push to your registry, signature verification prevents unauthorized images from running. Cosign makes this straightforward.

Skipping runtime monitoring. Image scanning only catches known vulnerabilities. An attacker exploiting a misconfiguration or a zero-day will not show up in any scan. Falco closes that gap.

Interview Questions

1. A container is running as root in a production pod. What are the risks and how do you fix it?

Running as root means if an attacker escapes the container, they have root access to the host. Risks include container breakout to host filesystem, binding to privileged ports, and capability escalation. Fix by: setting runAsNonRoot: true in pod security context, using a non-root user in the Dockerfile (USER instruction), and ensuring the image builds with a non-root user. Also set allowPrivilegeEscalation: false and drop all capabilities with capDrop: ALL.

2. You discover a critical CVE in a base image used across 200 microservices. Walk through your response.

First, stop the bleeding: block the vulnerable image version in your CI/CD admission control (OPA Gatekeeper or Kyverno). Identify all affected services via your image registry tags and deployment inventory. Prioritize by exposure (internet-facing vs internal) and data sensitivity. Build and push fixed images for the highest-priority services, test, and deploy. For lower-priority services, schedule into sprint planning. Set up automatic vulnerability scanning on new image pushes to catch this earlier. Consider a "golden image" strategy where security hardens a base image centrally.

3. How do you prevent a compromised CI/CD pipeline from deploying malicious images?

Use image signing and verification: sign images with Cosign or Notary during the build pipeline, then verify signatures at admission time using a policy controller (Kyverno or OPA Gatekeeper). Store signing keys in a KMS (AWS KMS, Google Cloud KMS, HashiCorp Vault). Enable admission control to reject unsigned images. Use short-lived tokens for CI/CD authentication rather than long-lived credentials. Audit all image pull events. Implement a software supply chain bill of materials (SBOM) to track what went into each image.

4. What is the difference between seccomp, AppArmor, and SELinux in the context of container security?

Seccomp restricts syscalls a container can make at the kernel level — the most granular control but requires knowing which syscalls an application needs. AppArmor works at the application level, restricting capabilities and file access paths — easier to use for known application profiles. SELinux works at the system level, labeling files and processes — most powerful but complex to configure. In practice: Docker defaults ship with a sensible seccomp profile blocking dangerous syscalls. For Kubernetes, seccomp via securityContext.seccompProfile and AppArmor via container.apparmor.security.beta.kubernetes.io are the common paths. SELinux is typically used at the host level.

5. How do you detect that a container has been compromised at runtime?

Runtime detection tools like Falco monitor syscall behavior and flag anomalous activity: a shell spawning inside a container, unexpected network connections, writing to sensitive paths like /etc/ or /root/. Sysdig captures system calls for deeper analysis. Network monitoring detects exfiltration attempts via unusual outbound traffic. Integrate these with your SIEM or alerting system. Also monitor container restart counts, unexpected process trees (kubectl top pods showing unusual CPU), and node-level indicators like new SSH keys in /root/.ssh/.

6. What are the key differences between Trivy and Grype for vulnerability scanning, and when would you choose one over the other?

Trivy is the default choice for most teams — it has a large vulnerability database, is fast, and integrates with most CI systems with minimal configuration. Grype is stronger when you need native SBOM support or want to scan SBOMs you already generated with Syft. Grype handles SBOM inputs directly while Trivy requires Syft as a separate step for SBOM generation. If your pipeline already generates SBOMs, Grype avoids adding another tool. If you want the simplest setup with the most baked-in integrations, Trivy wins.

7. Explain how Cosign image signing works and how you would enforce signature verification in a Kubernetes cluster.

Cosign generates a key pair, signs your container image digest, and stores the signature as an OCI artifact in your registry alongside the image. To enforce verification in Kubernetes, deploy Kyverno or OPA Gatekeeper as an admission controller. Create a policy that queries the registry for a valid Cosign signature before allowing the pod to start. If the image is not signed or the signature is invalid, the admission controller rejects the pod. Store the Cosign private key in a KMS (AWS KMS, Google Cloud KMS, HashiCorp Vault) — never in the pipeline itself. Rotation involves re-signing all images with the new key and updating the policy.

8. A container needs to write to the filesystem at runtime but you also want a read-only root filesystem. How do you handle this?

Identify the specific paths the application needs to write to at runtime. Common candidates are /tmp, /var/log, and /run. Create emptyDir volumes mount at those paths in your pod spec. Set readOnlyRootFilesystem: true globally, then selectively mount writable volumes to the specific locations the application requires. For log directories, mount an emptyDir or a mounted NFS share. This approach gives you the security benefit of a read-only root while allowing legitimate writes where needed. You can also use a tmpfs mount for temporary files.

9. How do Kubernetes Pod Security Standards (PSS) and Pod Security Admissions (PSA) work, and how do they compare to Pod Security Policies?

Pod Security Standards define three levels — privileged, baseline, and restricted — that cluster operators can enforce cluster-wide. Pod Security Admissions (built into Kubernetes 1.25+) is the controller that enforces those standards at the namespace level via labels. PSPs were the previous mechanism but were deprecated because they required a mutating admission webhook and had security gaps. To enforce restricted PSS on a namespace, label it pod-security.kubernetes.io/enforce: restricted. The PSA checks runs before pods are scheduled, rejecting those that violate the policy. This replaces the old PSP webhook approach with something simpler and more maintainable.

10. What are container capabilities, why is dropping ALL capabilities important, and how do you determine which capabilities your application actually needs?

Linux capabilities split the power of root into fine-grained units — CAP_NET_RAW lets a process send raw packets, CAP_SYS_TIME lets it set the system clock. Running as root in a container does not give all capabilities by default because Docker drops many, but not all. Dropping ALL capabilities and then adding back only what your application needs follows the principle of least privilege. To determine what your app needs: run it under seccomp with a permissive profile while monitoring which syscalls fire (use strace or Falco), then build a restrictive seccomp profile from that baseline. For capabilities, start with nothing and add them one at a time while testing functionality. CAP_NET_BIND_SERVICE is commonly needed for processes binding to ports below 1024.

11. How do you handle secrets management for containers — specifically, how do you avoid putting secrets in container images or environment variables?

Never bake secrets into images or pass them as plain-text environment variables — both end up in image layers and container logs. The Kubernetes-native approach is to use a secrets management tool like HashiCorp Vault with the Vault Secrets Operator or the CSI Secret Store driver, which mounts secrets as files in the container filesystem without ever exposing them as env vars. For Azure, use Key Vault with the AKV CSI provider. Alternatively, use Kubernetes external secrets with AWS Secrets Manager or GCP Secret Manager. The key principle: secrets should be injected at runtime from an external store, never baked into the image at build time.

12. Walk through the steps you would take to perform container forensics after detecting a potential compromise.

First, isolate the affected pod — prevent it from scheduling new work while you investigate. Capture the running container state: docker inspect for the container config, docker diff to see filesystem changes from the image, and docker logs for stdout/stderr. Extract the container's process tree with docker top and network connections with docker exec netstat or similar. Take a snapshot of the container's memory with docker checkpoint if your runtime supports it. Pull the image and compare it to the expected image digest. Preserve logs and audit trails before letting the pod restart or scaling it out.

13. How do you set up Kubernetes network policies to enforce container-to-container traffic and what are common mistakes in their configuration?

Network policies in Kubernetes are namespace-scoped and act as a firewall for egress and ingress traffic per pod. Label your namespaces and pods, then create a NetworkPolicy that selectors the appropriate pods. For a frontend-backend setup: the frontend policy allows ingress from the ingress controller only, the backend policy allows ingress from the frontend only. Common mistakes: forgetting that network policies are additive within a namespace (a pod with no policy is fully accessible), not accounting for DNS resolution (pods need to communicate with kube-system for DNS), and applying policies only to named namespaces without understanding that pods in unlabeled namespaces can still reach your services.

14. What is an SBOM, why does it matter for container security, and how do you generate and use one in a CI/CD pipeline?

An SBOM (Software Bill of Materials) is a structured inventory of every package and dependency in your container image — the equivalent of an ingredient list. It matters because when a new vulnerability drops (like Log4Shell), you query your SBOM database to identify every affected image in minutes rather than scanning each one individually. To generate one: use Syft to scan your image and produce an SPDX or CycloneDX SBOM. Store the SBOM alongside the image in your registry or in a separate artifact store. In CI/CD, generate the SBOM after building the image, store it as a build artifact, and integrate it with your vulnerability scanner so Grype or Trivy can correlate CVE data with your exact package versions.

15. How does Falco rule tuning work in practice, and how do you balance catching real threats against alert fatigue?

Start with the default Falco rule set and run in audit mode — Falco logs warnings but does not block. Collect a week of alerts and identify which rules fire hundreds of times per day. Those rules are noise in your environment. Suppress them by creating exceptions in falco config. Then identify which alerts represent genuine security signals by correlating with known incidents. Keep those rules. Review quarterly — rule effectiveness changes as your workload changes. The goal: security engineers should be able to investigate every Falco alert that fires in a day. If they cannot, you have too much noise and will miss real incidents.

16. What are the security implications of using hostPath volumes in Kubernetes and what alternatives should you use instead?

hostPath volumes let a container read/write files on the host node's filesystem. An attacker who escapes the container and has access to hostPath-mounted directories can read sensitive host data, write cron jobs to the host, or modify kubelet configuration. Alternatives: use Kubernetes ConfigMaps for configuration files, Secrets for credentials (via CSI or Vault), emptyDir for temporary storage, or PersistentVolumeClaims with appropriate access modes for persistent data. If you must use hostPath for system-level access (like the node's crictl socket for a CNI plugin), restrict it with PSP or PSS to only the specific service accounts that need it, and document why it is required.

17. How do you ensure that base images pulled from public registries are not compromised before using them in your builds?

Never pull by tag alone — tag mutability means node:18-alpine today is not the same image in six months. Pin to a specific digest in your Dockerfile: FROM node@sha256:abc123.... Scan every image in CI before it is used, even if it comes from a trusted registry like Docker Hub. Use a VEX (Vulnerability Exploitability eXchange) document to communicate which CVEs in your dependencies are not exploitable in your context. For critical workloads, maintain a hardened "golden image" base that your security team audits and signs with Cosign. Pull from official sources only and verify the registry's image signature when available.

18. Explain the difference between vulnerability scanning at build time versus runtime security monitoring. When would you rely on one versus the other?

Build-time scanning (Trivy, Grype) catches known vulnerabilities in your dependencies and base image layers before they reach production. It is preventive and deterministic — given the same image, it produces the same results. Runtime monitoring (Falco) catches anomalous behavior that is not in any vulnerability database: misconfiguration attacks, zero-days, and attacker behavior specific to your environment. You need both. Build-time scanning prevents known CVEs from reaching production. Runtime monitoring catches everything else — the attacks that exploit misconfigurations or vulnerabilities that have no CVE yet. Without runtime monitoring, a zero-day exploit of a medium-severity CVE will sail through because no scanner flags it.

19. What steps do you take to ensure your container image layers do not expose sensitive information or increase the attack surface unnecessarily?

Layer ordering matters: put instructions that change most frequently at the end of the Dockerfile so that cache invalidation does not rebuild sensitive layers. Never put secrets in RUN commands — they appear in the layer history. Use multi-stage builds so the final image contains only the runtime artifacts, not the build toolchain (which may include source code or build secrets). Set appropriate file permissions in the Dockerfile (chmod only what is needed). Remove package manager caches, package lists, and temporary files in the same layer that installs them. Validate that the final image does not contain shell, package managers, or debugging tools unless explicitly needed at runtime.

20. How would you integrate container security scanning into a CI/CD pipeline, and what do you do when a build fails due to a critical vulnerability?

In your CI pipeline, run trivy image --exit-code 1 --severity HIGH,CRITICAL after building and before pushing. If it exits with code 1, the build fails and the image is not pushed. Set appropriate severity thresholds — blocking on HIGH/CRITICAL is common; blocking on LOW/MEDIUM is too noisy for most teams. When a build fails due to a critical CVE, you have a few paths: update the affected dependency to a patched version (preferred), rebuild from a patched base image, apply a vulnerability exception with risk acceptance if the CVE is not exploitable in your context, or implement a compensating control like runtime monitoring to catch exploitation attempts. Never ignore critical CVEs without documented risk acceptance.

Layer	Tool	Preventative vs Detective	CI/CD vs Runtime
Image scanning	Trivy, Grype, Snyk	Preventative	CI/CD
Sig verification	Cosign, Notary	Preventative	CI/CD + Registry
Runtime monitoring	Falco, Sysdig	Detective	Runtime
Policy enforcement	OPA Gatekeeper, Kyverno	Preventative	Admission control
User namespace remapping	—userns-remap	Preventative	Daemon config
Syscall filtering	seccomp, AppArmor, SELinux	Preventative	Daemon config
Network policies	K8s NetworkPolicy	Preventative	Runtime

Conclusion

Key Takeaways

Image scanning catches known vulnerabilities; runtime monitoring catches anomalous behavior
Pin base images to digests, not tags, to prevent supply chain drift
Run containers as non-root with read-only filesystems to limit container escape blast radius
Cosign signatures prevent unauthorized images from reaching your cluster
Falco complements scanning by detecting post-deployment anomalies

Container Security Checklist

# 1. Scan every image in CI, block on HIGH/CRITICAL
trivy image --exit-code 1 --severity HIGH,CRITICAL myregistry/myapp:$GIT_COMMIT

# 2. Pin base images to digest
FROM node@sha256:abc123... AS builder

# 3. Build as non-root
RUN addgroup -S appgroup && adduser -S appuser -G appgroup
USER appuser

# 4. Enforce read-only root filesystem
securityContext:
  readOnlyRootFilesystem: true

# 5. Sign images with Cosign
cosign sign --key cosign.key myregistry/myapp:$GIT_COMMIT

# 6. Verify signatures in Kubernetes with Kyverno
kubectl apply -f kyverno-policy-require-signed-images.yaml

# 7. Deploy Falco as DaemonSet
helm install falco falcosecurity/falco -n falco --create-namespace

For more on securing Kubernetes workloads, see Network Security. For secrets handling, see Secrets Management.