Skip to main content

Trie - Question 1

 Leetcode 421 Maximum XOR of Two Numbers in an Array

Given an integer array nums, return the maximum result of nums[i] XOR nums[j], where 0 <= i <= j < n.

Constraints:

1 <= nums.length <= 2 * 10^5

0 <= nums[i] <= 2^31 - 1


Analysis:

The naïve method is to use a two-layer for loop, to check every pair of numbers. The total time complexity is O(N^2), which  is too large since N could be 2*10^5.

There are methods to using the bit-wise operations directly, which will be discussed in the Bit Manipulation post. Here we will focus on the method using trie.

The key ideas are:

1. for each number, there are 32 bits. For each bit, only two possible values are available; thus we can build a binary trie to store all the numbers;

2. once the trie is built (the height is 32), we can iterate array, for each number we can check its 32 bits along with the trie layer by layer (32 layers). If the bit for the number is 1, we want to find the node with a bit value of 0 (why?  because we are search for 'exclusive or', and 1^0 gives 1. If the node does not exist, then this bit has to be 0. Similar argument can be applied to the case that the bit value is 0.

3. since we are looking for maximum, we need to start with the most significant bit.

4. since all the numbers' bits are stored in the trie,  for each number, we just need to check 32 bits of the number vs 32 layer of the trie. Or in 32 steps, we can obtain the maximum xor results between the current number with all the numbers in the array. Thus, the overall time complexity is O(32*N). 

See the code bleow:


class Solution {
struct node {
    vector<node*> ns;
    node() {
        ns.resize(2);
    }
};
public:
    int findMaximumXOR(vector<int>& nums) {
        node* root = new node();
        buildTree(root, nums);
        int res = INT_MIN;
        for(auto &a : nums) {
            res = max(res, findMax(a, root));
        }
        return res;
    }
private:
    void buildTree(node* root, vector<int>& nums) {
        for(auto &a : nums) {
            node* p = root;
            for(int i=31; i>=0; --i) {
                if(a>>i & 1) {
                    if(!p->ns[1]) p->ns[1] = new node();
                    p = p->ns[1];
                }
                else {
                    if(!p->ns[0]) p->ns[0] = new node();
                    p = p->ns[0];
                }
            }
        }
    }
    int findMax(int val, node* root) {
        int res = 0;
        node* p = root;
        for(int i=31; i>=0; --i) {
            if(val>>i & 1) {
                if(p->ns[0]) {
                    res |= 1<<i;
                    p = p->ns[0];
                }
                else p = p->ns[1];
            }
            else {
                if(p->ns[1]) {
                    res |= 1<<i;
                    p = p->ns[1];
                }
                else p = p->ns[0];
            }
        }
        return res;
    }
};


Upper Layer

Comments

Popular posts from this blog

Sweep Line

Sweep (or scanning) line algorithm is very efficient for some specific questions involving discrete intervals. The intervals could be the lasting time of events, or the width of a building or an abstract square, etc. In the scanning line algorithm, we usually need to distinguish the start and the end of an interval. After the labeling of the starts and ends, we can sort them together based on the values of the starts and ends. Thus, if there are N intervals in total, we will have 2*N data points (since each interval will contribute 2). The sorting becomes the most time-consuming step, which is O(2N*log(2N) ~ O(N*logN). After the sorting, we usually can run a linear sweep for all the data points. If the data point is labeled as a starting point, it means a new interval is in the processing; when an ending time is reached, it means one of the interval has ended. In such direct way, we can easily figure out how many intervals are in the processes. Other related information can also be obt...

Bit Manipulation - Example

  Leetcode 136 Single Number Given a non-empty array of integers nums, every element appears twice except for one. Find that single one. You must implement a solution with a linear runtime complexity and use only constant extra space. Constraints: 1 <= nums.length <= 3 * 10^4 -3 * 10^4 <= nums[i] <= 3 * 10^4 Each element in the array appears twice except for one element which appears only once. Analysis: If there is no space limitation, this question can be solved by counting easily. But counting requires additional space. Here we can use xor (^) operation based on some interesting observations:  A^A = 0, here A is any number A^0 = A, here A is any number Since all the number appears twice except one, then all the number appear even numbers will be cancelled out, and only the number appears one time is left, which is what we want. See the code below: class Solution { public: int singleNumber(vector<int>& nums) { int res = 0; for(auto ...

Algorithm Advance Outline

Who wants to read these notes? 1. the one who wants to learn algorithm; 2. the one who has a planned interview in a short amount of time, such as one month later; 3. purely out of curiosity; 4. all the rest. The primary purpose for these posts is to help anyone who wants to learn algorithm in a short amount of time and be ready for coding interviews with tech companies, such as, Amazon, Facebook, Microsoft, etc. Before you start, you are expected to already know:  1. the fundamentals of at least one programming language; 2. the fundamentals of data structures. If you do not have these basics, it is better to "google and read" some intro docs, which should NOT take large amount of time. Of course, you can always learn whenever see a "unknown" data structure or line in the codes. Remember, "google and read" is always one of keys to learning. Common algorithms: 1. Recursion 2. Binary search 3. Dynamic programming 4. Breadth-first search 5. Depth-first search...