Longest Consecutive Sequence

medium · Arrays & Hashing · 26 min read

hash-set array union-find amortized-analysis chain-head

Introduction

An unsorted array of integers arrives, and the question is: what is the longest run of consecutive values — possibly scattered throughout the array — that together cover an interval with no gaps?

The obvious move is to sort and scan. It works in $O(n \log n)$ and is the answer most candidates produce. What makes this problem an interview favorite is that there is a better answer — one with amortized $O(n)$ time — and reaching for it requires the right move at the right time. That move is: build a hash set, then only walk forward from chain heads (values x whose predecessor x − 1 is not in the set). Skipping non-heads is what keeps the nested loop from being quadratic.

The trick isn’t the hash set. It’s knowing which elements NOT to start walking from.

If you can’t see immediately why the chain-head filter saves the asymptote, read the amortized analysis in the recommended approach — it’s the whole interview signal. If you can, skip to the union-find treatment: it’s the third lens and it matters when the input is streaming.

What the interviewer is actually testing

Willingness to move past the first idea. Candidates who sort and scan, state $O(n \log n)$ , and stop are giving the right answer with the wrong complexity. The test is whether you explicitly ask “can we do better?” even after finding a correct solution.
The chain-head insight. The move from “walk from every element” to “walk only from chain heads” is non-obvious. Naming it out loud — “I’ll start only from values whose predecessor is absent” — is the signal.
Amortized reasoning. The hash-set approach has a nested loop and looks like it could be $O(n^2)$ . Explaining why it’s not — each element is visited at most twice total across all walks — separates mid-level candidates from senior ones.
Tool selection. Knowing when union-find is the right tool (streaming updates, merging regions) versus when it’s overkill (static one-shot) is the senior signal.

Problem statement

Problem

Given an unsorted array of integers $A$ , return the length of the longest subsequence of $A$ whose values form a consecutive range (gaps of exactly 1 between sorted neighbors).

The subsequence need not appear contiguously in $A$ ; it just needs to exist as a set of values in $A$ .

Example 1

Input A = [100, 4, 200, 1, 3, 2]

Output 4

The run {1, 2, 3, 4} is present; 100 and 200 are singletons. Longest = 4.

Example 2

Input A = [45, 31, 46, 235, 28, 30, 29, 47]

Output 4

Three disjoint runs: {28, 29, 30, 31} (4), {45, 46, 47} (3), {235} (1).

Example 3

Input A = [1, 2, 2, 3, 0, -1]

Output 5

Duplicates collapse; negatives count; the full run {-1, 0, 1, 2, 3} is present.

Clarifying questions — and why each one matters

Clarification questions

Q Can the array contain duplicates?: A Ask. Usually yes. Duplicates collapse under a set-based solution (idempotent), but they inflate a naive sorted scan if you don't dedupe. This affects which counter you reset on repeats.
Q Can values be negative, or near INT_MIN / INT_MAX?: A Usually yes to negatives. At the int-boundary, walking x + 1 could overflow if you don't guard — rare in interview settings but worth mentioning.
Q Can the input be modified?: A Matters for the sort-scan baseline: if you can't mutate, you either copy or sort indices. The hash-set solution is read-only on the input.
Q What's the upper bound on n?: A Up to 10⁶ is typical. That's well within O(n) budget but suggests the interviewer wants you to beat O(n log n). Hash set is the expected answer.
Q Is the input streaming — do numbers arrive one at a time?: A Different problem shape. Streaming favors union-find: each new number is a singleton; try to merge with x − 1 and x + 1. Keep a running maximum component size.
Q Should we return the sequence itself, or just its length?: A Default is just the length. If the sequence is requested, remember the chain head that produced the winning length, then re-walk to materialize the values. No asymptotic change.

A mental framework — sort is the obvious move; asking “can I do better” is the signal

If the array were sorted, the problem is trivial: walk the array once, tracking a running-length counter that resets whenever the gap between consecutive sorted values isn’t exactly 1. That gives $O(n \log n)$ for the sort plus $O(n)$ for the scan — a correct and clean solution.

The question an interviewer hopes you’ll ask yourself is: do we actually need the elements sorted, or do we just need to know which values are present? Sorting is a strong tool with a lot of information; this problem only needs set membership — “is value y somewhere in the input?”. That’s the $O(1)$ operation a hash set gives you.

Once you have $O(1)$ membership, the shape of the algorithm becomes: for each starting value x, walk forward through x + 1, x + 2, ... as long as each is in the set. The longest walk wins.

The trap: a naive version runs that walk from every starting value. On the input [1, 2, 3, ..., n], starting from 1 walks through all n, starting from 2 walks through n − 1, and so on — $O(n^2)$ worst case. The fix is the chain-head filter in the recommended approach.

Discussion

Approach 01

Sort and scan — the obvious baseline

Sort, dedupe, walk the array. Reset the run counter whenever the next element isn't exactly one greater than the previous.

Time O(n log n) Space O(1) or O(n) baselinesortdedupe

The interview narration

Say this out loud:

“If the array were sorted, I could find the longest consecutive run with a single linear scan — reset a counter on every gap. So: sort first, then scan. $O(n \log n)$ for the sort dominates. Let me code it as a baseline and then try to do better.”

How it works, step by step

Sort the array in place (or a copy, if the input is immutable).
Dedupe. Consecutive duplicates would look like a gap of 0, not 1 — they’d reset the counter incorrectly. Removing them with std::unique after sorting keeps the scan clean. Alternatively, special-case “gap of 0 → don’t reset, don’t increment” during the scan.
Linear scan with a running run counter and a best tracker. For each $i > 0$ $i > 0$ :
- If nums[i] == nums[i - 1] + 1: extend the current run. Update best if needed.
- Otherwise: reset run to 1 (the current element alone).
Return best.

The code

#include <algorithm>
#include <vector>

int longestConsecutive(std::vector<int> nums) {
    if (nums.empty()) return 0;
    std::sort(nums.begin(), nums.end());
    nums.erase(std::unique(nums.begin(), nums.end()), nums.end());
    int best = 1;
    int run = 1;
    for (std::size_t i = 1; i < nums.size(); ++i) {
        if (nums[i] == nums[i - 1] + 1) {
            ++run;
            if (run > best) best = run;
        } else {
            run = 1;
        }
    }
    return best;
}

Sort + dedupe + single-pass scan. The `std::unique` + `erase` idiom trims consecutive duplicates in place after sort; the scan resets on every gap.

Two small C++ notes worth naming:

std::unique + erase. std::unique shifts distinct consecutive values to the front and returns an iterator to the new end; the erase truncates. The two must be paired — unique alone leaves the tail in an unspecified state.
Pass by value. Taking std::vector<int> nums by value instead of mutating the caller’s input is the idiomatic move when the algorithm needs to sort but the caller expects the input to be preserved.

Why it’s $O(n \log n)$

The sort is the bottleneck — standard comparison sort is $\Theta(n \log n)$ worst-case. The dedupe is $O(n)$ (one pass), and the final scan is $O(n)$ . Total: $O(n \log n)$ time. Space is $O(1)$ auxiliary if you sort in place, or $O(n)$ if you copy first.

Why the interviewer expects you to go further

$O(n \log n)$ with $O(1)$ extra space is a legitimate answer. But this problem is specifically designed to have a linear-time solution, and “can we do better than sort?” is a prompt you should ask yourself without being told. Name the insight: “we don’t need the elements sorted; we just need set membership.” Then switch to the hash-set approach.

Approach 02

Hash set with chain-head filter Recommended

Put every value in a hash set. For each value x, walk forward only if x is a chain head (x − 1 not in the set). Amortized O(n).

Time O(n) amortized Space O(n) hashamortizedrecommended

The insight, stated cleanly

Every element participates in exactly one consecutive chain. If we walk forward from the smallest member of each chain, we traverse each chain exactly once and each element is visited at most twice total. Starting from non-smallest members of the same chain would redo work already covered.

The test for “smallest member of a chain” is stunningly simple: x is a chain head iff x − 1 is not in the set. One hash-set lookup per element decides whether to walk or skip.

Live · chain-head filterStep through the scan. The small number under each cell is its touch counter — how many times that element has been examined. The amortized $O(n)$ argument says every counter stays ≤ 2 by the end.

Input:

raw input:[100, 4, 200, 1, 3, 2]

Step 0 of 13startchain length: 0best: 0

Build a hash set from the 6 unique values. Scan each one; only walk forward from chain heads (x − 1 not present).

How it works, step by step

Build the set. std::unordered_set<int> present(nums.begin(), nums.end()). This is $O(n)$ expected and deduplicates for free — inserting a duplicate is a no-op.
Scan every unique value. Iterate over the set (any order).
Filter non-heads. For each x, check present.count(x - 1). If yes, this value is mid-chain; some other value in the same chain is the head, and walking from it will cover x. Skip.
Walk from heads. For each chain head, increment a local length counter while x + 1, x + 2, ... are in the set.
Track the best. Update the global maximum after each walk.

The code

#include <unordered_set>
#include <vector>

int longestConsecutive(const std::vector<int>& nums) {
    std::unordered_set<int> present(nums.begin(), nums.end());
    int best = 0;
    for (const int x : present) {
        // Start a walk only from chain heads (x - 1 not present).
        // This keeps the inner loop from repeating work already covered
        // by a walk that started at a smaller member of the same chain.
        if (present.count(x - 1)) continue;
        int length = 1;
        for (int y = x + 1; present.count(y); ++y) ++length;
        if (length > best) best = length;
    }
    return best;
}

The chain-head filter in ten lines. The `continue` on line 8 is the entire O(n²) → O(n) trick.

The one comment inside the function names the invariant at the exact line code relies on it. That’s the methodology’s rare-exception case: a subtle correctness point where removal would be a bug.

The amortized argument — why this is $O(n)$ , not $O(n^2)$

The structure looks suspicious: an outer loop over every element, and an inner loop that walks forward. A reader who doesn’t see the filter might guess $O(n^2)$ .

The filter makes it $O(n)$ :

The outer loop runs $n$ times and does $O(1)$ work per iteration (one set lookup for the head check).
The inner walk fires only for chain heads. A chain of length $k$ has exactly one head; the walk from that head does $k$ set lookups.
Total inner work across all chains is $\sum_{\text{chains}} k_i = n$ (every element belongs to exactly one chain, and each gets visited once by the walk of its chain’s head).

Grand total: $n$ outer-loop checks + $n$ inner-walk steps = $2n$ operations = $O(n)$ .

The viz above visualizes this via the per-element touch counters: they stay ≤ 2 by the end of every run. One touch from the outer scan, at most one touch from being the target of a forward walk. No element is ever touched more than that, and many are touched just once.

Edge cases the mechanics handle for free

Empty array: the set is empty, the outer loop never fires, best stays 0. Correct.
Single element: the set has one value; it’s trivially a chain head (predecessor absent), walk length is 1, best = 1.
All duplicates: the set collapses to one value. Same as single element.
Already a consecutive run: one chain head at the minimum; walk covers all $n$ values; best = $n$ .
Negatives and zero: the x - 1 check works on any int; no special handling needed. Be mindful of $x =$ INT_MIN (then $x - 1$ underflows) — in practice unbounded in typical inputs; mention as a corner if pressed.

Why iterate over the set rather than the array

Iterating over the input array means we examine duplicates multiple times (the filter and walk are idempotent, so correctness is preserved, but each duplicate wastes a constant factor). Iterating over the set deduplicates automatically — every unique value is examined exactly once. The performance difference is a small constant, but the code reads cleaner.

Complexity

Time: $O(n)$ expected (amortized across the outer scan and inner walks). Set operations are $O(1)$ expected per call.
Space: $O(n)$ for the hash set.

Approach 03

Union-find over seen values

For each value x, create a singleton component. Union with x − 1 and x + 1 whenever they are already present. Track the largest component size.

Time O(n · α(n)) Space O(n) dsustreamingcomponents

When union-find earns its keep

The hash-set solution is simpler, faster in practice, and the right default. Union-find shines in two scenarios:

Streaming input. Numbers arrive one at a time; you must report the current longest run after each arrival. The hash-set approach would need to redo $O(n)$ work per query. Union-find does $O(\alpha(n))$ — essentially constant — per update.
Composing with DSU-native operations. If the rest of your pipeline uses union-find for other reasons (connectivity, Kruskal-style merging, dynamic equivalence classes), reusing the structure avoids a redundant hash set.

For the static one-shot problem, it’s overkill. Include it in an interview only if asked about streaming or if you want to show a second tool.

How it works, step by step

Use a hash-backed DSU (keys are the integer values themselves; no flat index required). For each incoming value x:

Add x. If x was already seen, skip (idempotent). Otherwise create a singleton component.
Union with x − 1. If x − 1 is in the DSU, unite(x, x − 1) merges the two components and updates the size of the root.
Union with x + 1. Same check, same merge.

After each add and each unite, the DSU tracks the running maximum component size. That’s the answer so far.

The DSU internals

Path compression during find flattens the tree: subsequent finds on any visited node become one-hop lookups.
Union by size (or by rank) keeps the tree shallow: the smaller component attaches to the larger.

Together, any $k$ operations on an $n$ -element DSU run in $O(k \cdot \alpha(n))$ , where $\alpha$ is the inverse Ackermann function — effectively constant for any realistic $n$ .

The code

#include <unordered_map>
#include <utility>
#include <vector>

class DSU {
  public:
    void add(int x) {
        if (parent_.count(x)) return;
        parent_[x] = x;
        size_[x] = 1;
        if (best_ < 1) best_ = 1;
    }

    bool contains(int x) const { return parent_.count(x) > 0; }

    int find(int x) {
        while (parent_[x] != x) {
            parent_[x] = parent_[parent_[x]];
            x = parent_[x];
        }
        return x;
    }

    void unite(int a, int b) {
        int ra = find(a);
        int rb = find(b);
        if (ra == rb) return;
        if (size_[ra] < size_[rb]) std::swap(ra, rb);
        parent_[rb] = ra;
        size_[ra] += size_[rb];
        if (size_[ra] > best_) best_ = size_[ra];
    }

    int best() const { return best_; }

  private:
    std::unordered_map<int, int> parent_;
    std::unordered_map<int, int> size_;
    int best_ = 0;
};

int longestConsecutive(const std::vector<int>& nums) {
    DSU dsu;
    for (const int x : nums) {
        dsu.add(x);
        if (dsu.contains(x - 1)) dsu.unite(x, x - 1);
        if (dsu.contains(x + 1)) dsu.unite(x, x + 1);
    }
    return dsu.best();
}

Hash-keyed DSU; each new value tries to merge with its numerical neighbors. The running maximum is a single field updated on every size change.

Complexity

Time: $O(n \cdot \alpha(n))$ — effectively $O(n)$ with a larger constant than the hash-set approach due to path-compression and size-tracking overhead.
Space: $O(n)$ for the parent and size maps.

Why the hash set is still the default

Both approaches are asymptotically linear. The hash set has a tighter constant, simpler code (no DSU class), and is more natural to explain in an interview. Mention union-find by name as an alternative — especially for streaming — but lead with the hash-set solution.

Approach comparison

Approach	Time	Space	Pick when…
Sort + scan	$O(n \log n)$	$O(1)$	Baseline / memory-constrained only.
Hash set + head filter	$O(n)$	$O(n)$	Default. Static one-shot problem.
Union-find	$O(n \cdot \alpha(n))$	$O(n)$	Streaming updates, or composing DSU.

Default answer: hash set with chain-head filter. Name the amortized argument out loud when you close.

Follow-ups to anticipate

The interviewer asks…	What to do
”Return the sequence, not just its length.”	Remember the chain head that produced the best length, then re-walk from it. Two passes, still $O(n)$ .
”Numbers arrive one at a time; report the longest run after each.”	Union-find. Each new number is an `add` plus up to two `unite` calls, each $O(\alpha(n))$ .
”The array has 10⁹ elements and can’t fit in memory.”	External sort + scan is the natural answer; streaming union-find if the values fit in memory even if the full multiset doesn’t.
”What if the integers are bounded to a small range, say [0, 10⁶]?”	Bit-set instead of hash set: one bit per value, $O(1)$ membership, far better cache locality. Same algorithm.
”How would you solve this without any hash table — say, in a language without one?”	Sort and scan, or union-find with an array-backed DSU keyed on rank-compressed values.
”Multidimensional — longest consecutive run in a matrix where adjacency is 4-connectivity.”	Different problem entirely: this is flood-fill (see Number of Islands).
”Average case for the hash approach?”	Still $O(n)$ . The worst case for hash operations is $O(n)$ per lookup on adversarial inputs, but with a well-distributed hash on integers this is vanishingly rare.
”Why not just use a sorted `std::set` with $O(\log n)$ lookups?”	The walk would then be $O(L \log n)$ per chain, totalling $O(n \log n)$ — no better than sort + scan. The point of the hash set is the $O(1)$ membership check.

Pattern recognition — amortized walks on set membership

The move this chapter teaches — build a hash set; walk forward from filtered starting points so each element is touched a bounded number of times — is a staple of amortized linear-time algorithms. Places it reappears:

Next Greater Element (monotonic stack). Each element is pushed and popped at most once; total work is $O(n)$ despite a nested loop shape.
Sliding Window Maximum. A monotonic deque touches each element at most twice — once on entry, once on exit.
Kth Largest in Stream. Amortized analysis on heap size bounds the work per insertion.
Longest Increasing Subsequence (patience sort). Binary search per element over a bounded-size sorted pile; total $O(n \log n)$ by amortized argument.
Find-all-duplicates-in-O(1)-space. The index-placement trick visits each element at most twice via a cyclic-swap argument.

The reflex: when a nested loop looks quadratic but “most” inner iterations are short, ask whether each element can be bounded by a constant number of total visits. If yes, the algorithm is amortized linear even though no single iteration proves it.

What to say out loud — the delivery script

Ten beats:

“Let me restate: given an unsorted array, return the length of the longest consecutive-integer subsequence.”
Ask: “duplicates allowed? how large is n? streaming or one-shot?”
“The obvious solution sorts and scans — $O(n \log n)$ . Let me name it, then try to beat it.”
“We don’t need the elements sorted; we just need to know which values are present. That’s a hash set, $O(1)$ membership.”
“With the set, I can walk forward from each value. But walking from every value is $O(n^2)$ on a long run. I need to restrict starting points.”
“Only walk from chain heads: values $x$ where $x − 1$ is not in the set. That guarantees each chain is traversed once.”
Write the code — eight lines. Point at the continue and say: “this is the load-bearing line.”
“Complexity argument: outer loop is $O(n)$ ; total inner-walk work sums to $n$ across all chains; total $2n = O(n)$ .”
Trace example 1 on the whiteboard. Show that $\{1, 2, 3, 4\}$ produces exactly one walk from 1, and that 2, 3, 4 all skip.
“For a streaming variant I’d switch to union-find. For a bounded-range variant I’d use a bitset. For the static problem, this hash-set approach is what I’d ship.”

Summary

Longest Consecutive Sequence looks like a sort-and-scan problem and turns out to have a linear-time answer that hinges on a single filter: only walk from chain heads. The insight — don’t start walking from values whose predecessor is already in the set — is what converts a naive nested loop into amortized linear work.

The wider lesson is the amortized-analysis reflex. Nested loops are not automatically quadratic. When each element is bounded in the number of total visits, the algorithm is linear no matter how the loops nest. That reflex unlocks the hash-set filter here, the monotonic-stack trick in next-greater-element, the sliding-window-max deque, and half a dozen other ” $O(n)$ in disguise” patterns across the interview canon.

Introduction

What the interviewer is actually testing

Problem statement

Clarifying questions — and why each one matters

Clarification questions

A mental framework — sort is the obvious move; asking “can I do better” is the signal

Discussion

The interview narration

How it works, step by step

The code

Why it’s O(nlog⁡n)O(n \log n)O(nlogn)

Why the interviewer expects you to go further

The insight, stated cleanly

How it works, step by step

The code

The amortized argument — why this is O(n)O(n)O(n), not O(n2)O(n^2)O(n2)

Edge cases the mechanics handle for free

Why iterate over the set rather than the array

Complexity

When union-find earns its keep

How it works, step by step

The DSU internals

The code

Complexity

Why the hash set is still the default

Approach comparison

Follow-ups to anticipate

Pattern recognition — amortized walks on set membership

What to say out loud — the delivery script

Summary

Why it’s $O(n \log n)$

The amortized argument — why this is $O(n)$ , not $O(n^2)$