HashTable源码阅读

环境jdk1.8.0_121

与HashMap有几点区别（不了解HashMap的具体实现，看我另个博客http://www.cnblogs.com/dj3839/p/8111675.html）

在HashMap中，冲突的值会在bucket形成链表，当达到8个，会形成红黑树，而在HashTable中，冲突的值就以链表的形式存储

    public synchronized V put(K key, V value) {
        // Make sure the value is not null
        if (value == null) {
            throw new NullPointerException();
        }

        // Makes sure the key is not already in the hashtable.
        Entry<?,?> tab[] = table;
        int hash = key.hashCode();
        int index = (hash & 0x7FFFFFFF) % tab.length;
        @SuppressWarnings("unchecked")
        Entry<K,V> entry = (Entry<K,V>)tab[index];
        for(; entry != null ; entry = entry.next) {
            if ((entry.hash == hash) && entry.key.equals(key)) {
                V old = entry.value;
                entry.value = value;
                return old;
            }
        }

        addEntry(hash, key, value, index);
        return null;
    }

会发现求索引的方式也不一样，(hash&0x7FFFFFFF)%tab.length，而在HashMap中是(hash^(hash>>>16))&(tab.length-1)，可以看出HashTable里，并没有做出相应的优化，这边解释下HashMap中的优化，(hash^(hash>>>16))这一步是其实是让一个hash值的高16位和低16位做异或，混合高位和低位，加大低位的随机性，(hash^(hash>>>16))&(tab.length-1)求与，其实就是相当于HashTable中的取模，只是在你计算机中用位预算效率比较高，当然tab.length在HashMap中其实是一个2的n次方，所以能达到这一的效果。

还有一点，可以看到HashTable中是不允许放值为Null的value，它会抛出错误。而且key值也不能为null，因为它直接拿key.hashCode()，null是拿不到hashCode也会发生错误。

继续看addEntry，开始添加元素

    private void addEntry(int hash, K key, V value, int index) {
        modCount++;

        Entry<?,?> tab[] = table;
        if (count >= threshold) {
            // Rehash the table if the threshold is exceeded
            rehash();

            tab = table;
            hash = key.hashCode();
            index = (hash & 0x7FFFFFFF) % tab.length;
        }

        // Creates the new entry.
        @SuppressWarnings("unchecked")
        Entry<K,V> e = (Entry<K,V>) tab[index];
        tab[index] = new Entry<>(hash, key, value, e);
        count++;
    }

代码非常简洁，如果数量大于限定值，就开始扩充，重新计算索引位置，然后插入

先看插入

tab[index] = new Entry<>(hash, key, value, e);

在创建entry的时候，传了个bucket的第一个entry，

        protected Entry(int hash, K key, V value, Entry<K,V> next) {
            this.hash = hash;
            this.key =  key;
            this.value = value;
            this.next = next;
        }

看构造函数其实可以看出，在这里进行指向旧的第一个entry，因此，在hashtable中其实是插入在链表的头，而在HashMap是在尾

然后我们在看它的rehash

    protected void rehash() {
        int oldCapacity = table.length;
        Entry<?,?>[] oldMap = table;

        // overflow-conscious code
        int newCapacity = (oldCapacity << 1) + 1;
        if (newCapacity - MAX_ARRAY_SIZE > 0) {
            if (oldCapacity == MAX_ARRAY_SIZE)
                // Keep running with MAX_ARRAY_SIZE buckets
                return;
            newCapacity = MAX_ARRAY_SIZE;
        }
        Entry<?,?>[] newMap = new Entry<?,?>[newCapacity];

        modCount++;
        threshold = (int)Math.min(newCapacity * loadFactor, MAX_ARRAY_SIZE + 1);
        table = newMap;

        for (int i = oldCapacity ; i-- > 0 ;) {
            for (Entry<K,V> old = (Entry<K,V>)oldMap[i] ; old != null ; ) {
                Entry<K,V> e = old;
                old = old.next;

                int index = (e.hash & 0x7FFFFFFF) % newCapacity;
                e.next = (Entry<K,V>)newMap[index];
                newMap[index] = e;
            }
        }
    }

比HashMap简单的多。。。len扩充2*len+1，然后对原来bucket中的entry重新计算索引，并赋值，不改变链表原先的顺序，在HashMap中复杂的多，可以看我另个讲HashMap的博客

而且在hashtable中，调用构造函数时，直接初始化了里面的数组table，而在hashmap中是在进行put操作时，进行初始化，这个操作也在resize中

可以看下HashTable中的初始化方法

    public Hashtable(int initialCapacity, float loadFactor) {
        if (initialCapacity < 0)
            throw new IllegalArgumentException("Illegal Capacity: "+
                                               initialCapacity);
        if (loadFactor <= 0 || Float.isNaN(loadFactor))
            throw new IllegalArgumentException("Illegal Load: "+loadFactor);

        if (initialCapacity==0)
            initialCapacity = 1;
        this.loadFactor = loadFactor;
        table = new Entry<?,?>[initialCapacity];
        threshold = (int)Math.min(initialCapacity * loadFactor, MAX_ARRAY_SIZE + 1);
    }

还有一点，HashTable它的初始化，默认容量len是为11，后面也是2*len+1扩充，而HashMap是16，以后的扩充数量都是len*2，并且我们提供容量大小时，也是会转成一个2的n次方，为什么会有这样的区分，和它计算hash有关，在前面提到了(hash^(hash>>>16))&(tab.length-1)，2^n-1，二进制就是n个1

    public Hashtable() {
        this(11, 0.75f);
    }

但相对HashMap，HashTable是线程安全的，因为在很多方法，比如get,put,equals等，都使用了synchronized同步锁。

总结：

1. HashTable的key、value不能为null

2. HashTable线程安全

3. HashTable的优化其实没有HashMap做的好，在单线程的情况，最好使用HashMap

贴上一句源码中的提示

 * Java Collections Framework</a>.  Unlike the new collection
 * implementations, {@code Hashtable} is synchronized.  If a
 * thread-safe implementation is not needed, it is recommended to use
 * {@link HashMap} in place of {@code Hashtable}.  If a thread-safe
 * highly-concurrent implementation is desired, then it is recommended
 * to use {@link java.util.concurrent.ConcurrentHashMap} in place of
 * {@code Hashtable}.

大致意思就是：不需要线程安全用HashMap，需要线程安全的高并发用ConcurrentHashMap

发表于 2018-01-17 14:47 hongdongjian 阅读(437) 评论(0) 编辑收藏举报

公告