摘要: When I tried some SIMD optimization in large-scale simulation(HPC), it is so difficult to implment. Since there is no easy way to change the simu... 阅读全文
posted @ 2015-12-16 13:38 水木龙助 阅读(229) 评论(0) 推荐(0)
摘要: * stackoverflow: how does BLAS get such extern performance * Howto optimizate GEMMhttp://wiki.cs.utexas.edu/rvdg/HowToOptimizeGemm/ * ulmBLAS:http://... 阅读全文
posted @ 2015-12-10 07:03 水木龙助 阅读(217) 评论(0) 推荐(0)
摘要: source from:https://computing.llnl.govFactors determines a large-scale program's performance 4 * Application related factors: 5 * algorith... 阅读全文
posted @ 2015-12-08 08:15 水木龙助 阅读(452) 评论(0) 推荐(0)
摘要: technologies: OS, CPU cache, numa structure, memory access 阅读全文
posted @ 2015-12-02 11:38 水木龙助 阅读(153) 评论(0) 推荐(0)
摘要: parallel processing shell script Oct 19Q: how to schedule multi-cpus on each event? 阅读全文
posted @ 2015-10-20 07:01 水木龙助 阅读(350) 评论(0) 推荐(0)
摘要: https://www.physics.ohio-state.edu/~wilkins/computing/HDF/hdf5tutorial/index.html 阅读全文
posted @ 2015-10-14 10:02 水木龙助 阅读(320) 评论(0) 推荐(0)
摘要: posix threading programming beej's guide to unix ipc the gnu c library: virtual memory allocation and paging the linux programming guide 阅读全文
posted @ 2015-09-30 09:24 水木龙助 阅读(135) 评论(0) 推荐(0)