UUID performance生成uuid的性能问题

项目中需要将原有的id统一改为uuid,于是直接用boost中的uuid库来生成uuid。

但在测试中发现程序性能严重下降,原来每分钟能处理60万条数据,使用uuid后只能处理2万条左右。

于是跟踪之,先看我们的实现方式:

1 static std::string uuid()
2 {
3     boost::uuids::uuid uuid = boost::uuids::random_generator()();
4     return boost::lexical_cast<std::string>(uuid);
5 }
6                 

实测下来,生成uuid的速度大概在1000/s,这太低效了!

 

google之,发现问题可能在于生成uuid时的随机种子:

The requirement for uuid::create to be fast conflicts with the requirement that it needs to create a "universally unique identifier", that is, a high-quality random number. Typically, high-quality randomness is more important than performance, as one rarely needs to create one million UUIDs. One could argue that the high-performance case is adequately addressed by 
the engine-taking overload.

http://boost.2283326.n4.nabble.com/uuid-uuid-create-is-extremely-slow-td2627839.html

 

最后同事改成了这样:

static std::string uuid()
{
        static time_t t = time(NULL);
        boost::mt19937 ran;
        ran.seed(t++);
        boost::uuids::basic_random_generator<boost::mt19937> gen(&ran);
        boost::uuids::uuid u = gen();
        return boost::lexical_cast<std::string>(u);
}

使用自己的随机种子,适当的降低了生成uuid的质量,最后测下来生成uuid的速度大概在 60000/s,还可以接受。

 

我另外还尝试了uuid-dev,首先安装库:

apt-get install uuid-dev

测试代码:

#include <iostream>
#include <uuid/uuid.h>
#include <stdio.h>

using namespace std;


typedef struct _GUID
{
    unsigned long Data1;
    unsigned short Data2;
    unsigned short Data3;
    unsigned char Data4[8];
} GUID, UUID;

std::string GuidToString(const GUID &guid)
{
    char buf[64] = {0};
    snprintf(
                buf,
                sizeof(buf),
                 "{%08X-%04X-%04X-%02X%02X-%02X%02X%02X%02X%02X%02X}",
                guid.Data1, guid.Data2, guid.Data3,
                guid.Data4[0], guid.Data4[1],
                guid.Data4[2], guid.Data4[3],
                guid.Data4[4], guid.Data4[5],
                guid.Data4[6], guid.Data4[7]);
        return std::string(buf);
}

int main(int argc, void** argv)
{
        GUID guid;
        time_t t = time(NULL);
        for(int i=0; i<1000000; i++)
        {
                uuid_generate(reinterpret_cast<unsigned char *>(&guid));
        }
        std::cout<<time(NULL)-t<<std::endl;
}

g++ -o test test.cpp -luuid

测试结果为20万/秒,更快了呵。

 

 -------------------------------------------------------------------------

update:

在多线程环境下,boost生成的uuid会重复,推测可能是随机种子不可重入造成的,也没有深究。

最后还是使用uuid-dev了,实测多线程无此问题。

 

 

 

 

 

 

 

posted @ 2012-08-15 17:20    阅读(3598)  评论(0编辑  收藏  举报