How to Use a Reverse Trie for Fast Disposable Email Domain Detection
Learn how to use a reverse Trie to efficiently detect disposable email domains. Optimize your domain lookups with a scalable, memory-efficient solution tailored for fast and precise results.
Disposable emails can cause issues like fake signups and spam. The user grabs an address from one of thousands of temporary email generators and hands it over. Not even the GOAT email regex can save you here.
Personally, I find having a big list of all disposable email domains is the easiest yet most effective solution. But before you assemble that list and start a for ... of loop to check against it, think of the O(n) complexity!
A great way to identify them is by using a reverse Trie, an efficient data structure for fast lookups.
What Is a Reverse Trie?
First, let's grasp what a Trie is. It is a data structure where strings are:
- chopped up, char per char
- assembled in a tree structure
example, if we feed boa
, bro
, brie
, it would assemble them using Map as:
This approach allows direct lookups without cycling through the entire list. Each character guides the search deeper.
It trades memory for efficiency. The time it takes to find the string does not depend on the size of the list, but on the length of the string!
A reverse Trie stores strings in reverse order, ideal for domains:
mailinator.com
becomesmoc.rotanliam
trashmail.com
becomesmoc.liambhsart
Note on This Implementatin
By reversing domains, searches start at the TLD (e.g., .com
), which is shared across many domains. To optimize further, it stores TLD
s as a single key (com
), rather than splitting them into characters. The rest of the domain follows a standard Trie structure.
Reverse Trie Domains Implementation
Since this is a tree structure, each node will reference its children:
First, a utility function to split the TLD from the rest of the domain:
Using
lastIndexOf
ensures subdomains likefoo.bar.baz.com
are handled correctly.
Next, the constructor will assemble the Trie:
To check if a domain is disposable, traverse the Trie:
Conclusion
Using a reverse Trie offers several benefits:
- Fast Lookups: Traverse characters step-by-step for quick results.
- Memory Efficiency: Common suffixes like
.com
are stored only once. - Scalability: Handles large domain lists effortlessly.
If you’re dealing with disposable emails, this is a smart, scalable solution to implement.
On This Page