B: Cuckoo Hashing

One of the most fundamental data structure problems is the dictionary problem: given a set D of words you want to be able to quickly determine if any given query string q is present in the dictionary D or not. Hashing is a well-known solution for the problem. The idea is to create a function h : $sum^{{ast}}_{}$ [0..n - 1] from all strings to the integer range 0, 1,..., n - 1 , i.e. you describe a fast deterministic program which takes a string as input and outputs an integer between 0 and n - 1 . Next you allocate an empty hash table T of size n and for each word w in D , you set T[h(w)] = w . Thus, given a query string q , you only need to calculate h(q) and see if T[h(q)] equals q , to determine if q is in the dictionary. Seems simple enough, but aren't we forgetting something? Of course, what if two words in D map to the same location in the table? This phenomenon, called collision, happens fairly often (remember the Birthday paradox: in a class of 24 pupils there is more than 50% chance that two of them share birthday). On average you will only be able to put roughly -sized dictionaries into the table without getting collisions, quite poor space usage!

A stronger variant is Cuckoo Hashing (Cuckoo Hashing was suggested by the danes R. Pagh and F. F. Rödler in 2001). The idea is to use two hash functions h₁ and h₂ . Thus each string maps to two positions in the table. A query string q is now handled as follows: you compute both h₁(q) and h₂(q) , and if T[h₁(q)] = q , or T[h₂(q)] = q , you conclude that q is in D . The name ``Cuckoo Hashing" stems from the process of creating the table. Initially you have an empty table. You iterate over the words d in D , and insert them one by one. If T[h₁(d )] is free, you set T[h₁(d )] = d . Otherwise if T[h₂(d )] is free, you set T[h₂(d )] = d . If both are occupied however, just like the cuckoo with other birds' eggs, you evict the word r in T[h₁(d )] and set T[h₁(d )] = d . Next you put r back into the table in its alternative place (and if that entry was already occupied you evict that word and move it to its alternative place, and so on). Of course, we may end up in an infinite loop here, in which case we need to rebuild the table with other choices of hash functions. The good news is that this will not happen with great probability even if D contains up to n/2 words!

Input

On the first line of input is a single positive integer 1t50 specifying the number of test cases to follow. Each test case begins with two positive integers 1mn10000 on a line of itself, m telling the number of words in the dictionary and n the size of the hash table in the test case. Next follow m lines of which the i :th describes the i :th word d_i in the dictionary D by two non-negative integers h₁(d_i) and h₂(d_i) less than n giving the two hash function values of the word d_i . The two values may be identical.

Output

For each test case there should be exactly one line of output either containing the string ``successful hashing" if it is possible to insert all words in the given order into the table, or the string ``rehash necessary" if it is impossible.

Sample Input

Sample Output

successful hashing 
rehash necessary

題目描述：

一個字典的 hash，每個單詞只有兩種 hash 值，問能不能得完全匹配。

題目解法：

maxflow 一直 TLE 的情況下，實施了匈牙利算法。

#include <stdio.h>
#include <string.h>
#include <math.h>
#include <algorithm>
#include <queue>
using namespace std;
struct Node {
    int y;
    int next;
} edge[100005];
int e, head[10005];
void addEdge(int x, int y) {
    edge[e].y = y;
    edge[e].next = head[x], head[x] = e++;
}
int mx[10005], my[10005], used[10005];
int dfs(int now) {
    int i, x;
    for(i = head[now]; i != -1; i = edge[i].next) {
        x = edge[i].y;
        if(!used[x]) {
            used[x] = 1;
            if(my[x] == -1 || dfs(my[x])) {
                mx[now] = x, my[x] = now;
                return 1;
            }
        }
    }
    return 0;
}
int main() {
    int testcase, n, m;
    int i, j, x, y;
    scanf("%d", &testcase);
    while(testcase--) {
        scanf("%d %d", &n, &m);
        e = 0;
        memset(head, -1, sizeof(head));
        for(i = 0; i < n; i++) {
           scanf("%d %d", &x, &y);
           addEdge(i, x);
           addEdge(i, y);
        }
        memset(mx, -1, sizeof(mx));
        memset(my, -1, sizeof(my));
        int match = 0;
        for(i = 0; i < n; i++) {
            if(mx[i] == -1) {
                memset(used, 0, sizeof(used));
                if(dfs(i))
                    match++;
                else
                    break;// cut condition.
            }
        }
        puts(match == n ? "successful hashing" : "rehash necessary");
    }
    return 0;
}

我要檢舉

#11363#Cuckoo Hashing#匈牙利

台長： Morris

您可能對以下文章有興趣

[UVA] 10371 - Time Zones

[UVA][二分貪婪] 1199 - Elevator Stopping Plan

[UVA] 957 - Popes

[UVA][搜索] 11283 - Playing Boggle

人氣(1,369) | 回應(0)| 推薦 (0)| 收藏 (0)| 轉寄
全站分類: 教育學習(進修、留學、學術研究、教育概況) | 個人分類: UVA |
此分類下一篇:[UVA][離散化、Dinic] 11358 - Faster Processing Feasibility
此分類上一篇:[UVA][SCC、暴搜] 11390 - The Sultan's Feast

回應(0)