版权说明 操作指南
首页 > 成果 > 成果详情

A component histogram map based text similarity detection algorithm

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
期刊论文
作者:
Huang, Huajun;Pang, Shuang;Deng, Qiong;Qin, Jiaohua
通讯作者:
Huang, Huajun(hhj0906@163.com)
作者机构:
[Huang, Huajun; Pang, Shuang; Deng, Qiong; Qin, Jiaohua] College of Computer and Information Engineering, Central South University of Forestry and Technology, 498 Shaoshan South Road, CHangsha, Hunan Province
410004, China
[Huang, Huajun; Pang, Shuang; Deng, Qiong; Qin, Jiaohua] 410004, China
通讯机构:
College of Computer and Information Engineering, Central South University of Forestry and Technology, 498 Shaoshan South Road, CHangsha, Hunan Province, China
语种:
英文
关键词:
Algorithms;Signal detection;Characteristic vectors;Chinese characters;Distance calculation;Distance formula;Jaccard coefficients;Mathematical expressions;Text similarity;Word frequencies;Graphic methods
期刊:
International Journal of Network Security
ISSN:
1816-353X
年:
2015
卷:
17
期:
5
页码:
637-642
机构署名:
本校为第一且通讯机构
院系归属:
计算机与信息工程学院
摘要:
The conventional text similarity detection usually use word frequency vectors to represent texts. But it is high-dimensional and sparse. So in this research, a new text similarity detection algorithm using component histogram map (CHM-TSD) is proposed.This method is based on the mathematical expression of Chinese characters, with which Chinese characters can be split into components. Then each components occurrence frequency will be counted for building the component histogram map (CHM) in a text as text characteristic vector. Four distance formulas are used to find which the best distance for...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com