This topic created in 3131 days ago, the information mentioned may be changed or developed.
因为全文的 diff 需要较大运算量,希望不遍历全文就可以估计差异百分比之类的,用来判断只是在现有文件上进行了较小的改动,还是几乎是全新的内容。有较好的算法吗?
1 replies • 2017-10-12 18:42:56 +08:00
 |
|
1
holajamc Oct 12, 2017 1
simhash minhash ?
|