Table of Contents

steghide in CTF: How to Hide and Extract Data from Images

執筆者:

カテゴリ:

steghide in CTF has one feature that catches almost every first-time user: the error message for a wrong passphrase is identical to the error message for a file with no embedded data. If you don’t know this going in, you’ll waste time assuming the tool is broken when the real problem is either that there’s nothing there, or that you’re using the wrong passphrase. The picoCTF “Hidden in Plainsight” challenge handed me this confusion directly — and the passphrase was visible from the first command I ran.

The error message trap: steghide’s biggest diagnostic problem

Run steghide with the wrong passphrase on a file that has embedded data:

$ steghide extract -sf img.jpg
Enter passphrase:
steghide: could not extract any data with that passphrase!

Now run steghide on a JPEG with no embedded data at all:

$ steghide extract -sf clean.jpg
Enter passphrase:
steghide: could not extract any data with that passphrase!

The error is identical. steghide cannot tell you whether nothing is there or you have the wrong key — the decryption failure looks the same either way. This is by design (it avoids leaking information about whether data exists), but it means you need a different diagnostic approach. The correct tool is steghide info:

$ steghide info img.jpg
"img.jpg":
  format: jpeg
  capacity: 4.0 KB
Try to get information about embedded data ? (y/n) y
Enter passphrase:
  embedded file "flag.txt":
    size: 34.0 Byte
    encrypted: rijndael-128, cbc
    compressed: yes

A successful info response confirms both that data is present and that the passphrase is correct. If it fails, you either have the wrong passphrase or a clean file. This distinction matters: if you’re confident the file has steganographic content, keep searching for the passphrase. If you’re unsure, move on and try other tools.

picoCTF Hidden in Plainsight: the passphrase was in the first command

The challenge provided a single file: img.jpg. My first attempt was a blank passphrase — a lot of Easy-tier challenges skip passwords entirely.

$ steghide extract -sf img.jpg
Enter passphrase:
steghide: could not extract any data with that passphrase!

Then I tried “password”, “picoctf”, “flag” — standard guesses. All failed with the same message. I ran binwalk:

$ binwalk img.jpg
DECIMAL       HEXADECIMAL     DESCRIPTION
--------------------------------------------------------------------------------
0             0x0             JPEG image data, JFIF standard 1.01

Nothing. Binwalk finding only the JPEG header is expected for steghide-modified files — steghide doesn’t append data, it modifies existing DCT coefficients. Binwalk’s signature scanning finds appended data; it won’t detect steghide content.

About fifteen minutes in, I ran file out of habit:

$ file img.jpg
img.jpg: JPEG image data, JFIF standard 1.01, aspect ratio, density 1x1,
segment length 16, comment: "c3RlZ2hpZGU6Y0VGNmVuZHZjbVE9",
baseline, precision 8, 640x640, components 3

I almost skimmed past it. The file output is dense and I was reading quickly. The word comment stopped me: c3RlZ2hpZGU6Y0VGNmVuZHZjbVE9. That’s base64 — recognizable by the alphanumeric characters and the = padding at the end.

JPEG files have a comment segment in their header. Camera firmware uses it for metadata. Challenge authors use it to hide passphrases.

import base64
print(base64.b64decode("c3RlZ2hpZGU6Y0VGNmVuZHZjbVE9").decode())

steghide:cEF6endvcmQ=

The output split at a colon: the tool name on the left, another base64 string on the right. The challenge was literally naming the tool and encoding the passphrase in two layers.

print(base64.b64decode("cEF6endvcmQ=").decode())

pAzzword

Double-encoded. The passphrase was pAzzword — written to look like “password” but different enough to resist guessing. Extraction:

$ steghide extract -sf img.jpg
Enter passphrase:
wrote extracted data to "flag.txt".

$ cat flag.txt
picoCTF{h1dd3n_1n_1m4g3_67479645}

The passphrase had been visible from the moment I ran file. I spent fifteen minutes guessing passwords that were nowhere near pAzzword before reading the output that already had the answer.

When you don’t have the passphrase: Stegseek

Hidden in Plainsight had a findable passphrase in the metadata. Some challenges don’t — the passphrase is just a password and you need to brute-force it. steghide has no built-in brute force. Use Stegseek instead:

$ stegseek --crack image.jpg /usr/share/wordlists/rockyou.txt
StegSeek 0.6 - https://github.com/RickdeJager/stegseek

[i] Progress: 99.41% (133.4 MB)
[!] error: Could not find a valid passphrase.

# If found:
[i] Found passphrase: "password123"
[i] Original filename: "flag.txt"
[i] Extracting to "image.jpg.out"

Stegseek is roughly 1000x faster than Stegcracker (a common alternative) because it uses a C implementation rather than calling steghide for each attempt. On rockyou.txt with 14 million entries, Stegseek finishes in seconds to minutes. Stegcracker on the same wordlist would take hours.

The decision tree for unknown passphrases:

Check file metadata first: file, exiftool, strings. The passphrase is often hiding in plain sight (literally, in this case).
Try blank passphrase: steghide extract -sf image.jpg with empty input.
Try obvious guesses from challenge context: challenge name, competition name, year.
Stegseek with rockyou.txt if the above fails.
If Stegseek exhausts rockyou.txt: the passphrase is probably not dictionary-crackable — look for other clues in the challenge.

How steghide actually hides data in a JPEG

JPEG compression works by converting image blocks to frequency coefficients using Discrete Cosine Transform (DCT). The resulting DCT coefficient table has many near-zero values that encode fine detail. Steghide selects pairs of these coefficients and swaps them to encode bits — a swap that changes the coefficient values by ±1. At the pixel level, this produces changes of less than 0.4% of the color range, invisible to the human eye and to most statistical analysis.

This is why binwalk finds nothing: there’s no appended data, no new file signature, no structural change to detect. The content is distributed across the existing DCT table. Detecting steghide modifications requires statistical analysis of DCT coefficient distributions — tools like StegExpose do this, but it’s rarely needed in CTF contexts where the challenge tells you steganography is involved.

steghide also supports WAV files, where it modifies audio sample LSBs rather than DCT coefficients. The command syntax is identical. The capacity is determined by file size: a 1MB WAV at 44100 Hz/16-bit/stereo can hold roughly 50KB of embedded data.

Full trial process: Hidden in Plainsight

Step	Command	Result	Time	Outcome
1	Visual inspection	Nothing visible	3 min	Steghide content is invisible by design
2	`steghide extract` (blank)	Error: could not extract	30s	Not blank passphrase
3	Common guesses: “password”, “picoctf”, “flag”	Same error	5 min	Error identical for wrong passphrase AND no data — wasted time
4	`binwalk img.jpg`	Only JPEG header	30s	Expected — steghide doesn’t append data
5	`file img.jpg`	comment: “c3RlZ2hpZGU6Y0VGNmVuZHZjbVE9”	10s	Breakthrough — base64 in JPEG comment field
6	Base64 decode ×2	`pAzzword`	2 min	Double-encoded passphrase revealed
7	`steghide info img.jpg`	flag.txt confirmed, 34 bytes	30s	Verified before extracting
8	`steghide extract -sf img.jpg`	flag.txt extracted	10s	`picoCTF{h1dd3n_1n_1m4g3_67479645}`

steghide vs other stego tools: how I actually decide

Tool	File types	Use when	Don’t use when
steghide	JPEG, WAV	You have a passphrase candidate; checking if data is present	PNG or BMP files — not supported
Stegseek	JPEG	Passphrase unknown; brute force against wordlist	WAV files — JPEG only; passphrase not in rockyou.txt
zsteg	PNG, BMP	PNG LSB analysis; no passphrase needed	JPEG — wrong tool entirely
binwalk	Any	Initial recon; finding appended files with signatures	Detecting steghide content — it modifies existing data, not appends
exiftool	Any	Reading full metadata including EXIF, comments, GPS	Extracting hidden payload — metadata read-only

The format wall is the most common mistake: steghide only works on JPEG and WAV. Running it against a PNG will either fail or produce misleading output. Check the file extension and the file command output before deciding which tool to use.

My pre-extraction checklist for steganography challenges

# 1. Identify format — steghide only works on JPEG and WAV
file target.jpg

# 2. Read ALL metadata — passphrase is often hiding here
file target.jpg          # check comment field
exiftool target.jpg      # full EXIF data
strings target.jpg | grep -v "^.\{1\}$"  # non-trivial strings

# 3. Check if anything is embedded (blank passphrase)
steghide info target.jpg  # enter blank passphrase

# 4. Try context-derived passphrases
steghide extract -sf target.jpg  # enter: challenge name, competition, year

# 5. Brute force if needed
stegseek --crack target.jpg /usr/share/wordlists/rockyou.txt

# 6. If JPEG yields nothing, check for other formats
# PNG → zsteg target.png
# Any format → binwalk target.jpg (look for appended signatures)

Why JPEG metadata is a real attack surface

The Hidden in Plainsight challenge used the JPEG comment field — a segment present in virtually every JPEG file, often containing camera firmware strings, copyright notices, or software version info. In real-world scenarios, this field has been used to exfiltrate data (embedding messages in images uploaded to public servers) and to store configuration data in malware droppers that disguise payloads as image files.

The habit of reading file output carefully — not just confirming the file type but reading every field — translates directly to real forensic analysis. Most analysts run file, see “JPEG image data,” and move on. The comment field is rarely empty in CTF challenges for exactly this reason.

steghide in CTF: How to Hide and Extract Data from Images

The error message trap: steghide’s biggest diagnostic problem

picoCTF Hidden in Plainsight: the passphrase was in the first command

When you don’t have the passphrase: Stegseek

How steghide actually hides data in a JPEG

Full trial process: Hidden in Plainsight

steghide vs other stego tools: how I actually decide

My pre-extraction checklist for steganography challenges

Why JPEG metadata is a real attack surface

Further Reading

コメント

Leave a Reply Cancel reply

投稿をさらに読み込む

Local Authority picoCTF Writeup

Inspect HTML picoCTF Writeup

Wireshark doo dooo do doo picoCTF Writeup

Crack the Gate 1 picoCTF Writeup