Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Deep Semantic Hashing Using Pairwise Labels

Authors
Xuan, RichengShim, JunhoLee, Sang-Goo
Issue Date
Jun-2021
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Keywords
Semantics; Neural networks; Deep learning; Binary codes; Training; Hash functions; Decoding; Information retrieval; natural language processing; semantic hashing
Citation
IEEE ACCESS, v.9, pp 91934 - 91949
Pages
16
Journal Title
IEEE ACCESS
Volume
9
Start Page
91934
End Page
91949
URI
https://scholarworks.sookmyung.ac.kr/handle/2020.sw.sookmyung/146597
DOI
10.1109/ACCESS.2021.3092150
ISSN
2169-3536
2169-3536
Abstract
Data hashing has been widely used to approximate large-scale similarity searches. Original text data can be represented using compact binary codes through hashing. Recent advances in neural network architecture have demonstrated the effectiveness of this method and its ability to learn hash functions more accurately. Most previous studies have been focused on encoding explicit supervised features, such as pointwise labels. Owing to the special nature of textual data, previous semantic text hashing approaches have only utilized pointwise label information. The purpose of the learning hash code developed in the present study is to make similar or related text have similar hash codes. Separate label learning for each datum is the easiest means of achieving this objective, but some inconsistencies remain. However, pairwise label information reflects the similarity more intuitively than pointwise label data. This paper proposes a supervised semantic text hashing method that utilizes pairwise label information. Several different methods based on the variational auto-encoder model are employed to calculate the pairwise similarity of text pairs. Because the similarity calculation process does not require additional parameters, the entire learning process is faster and more efficient than those in the existing methods. The experimental results obtained using public datasets show that the proposed method can exploit pairwise label information sufficiently well to outperform previous state-of-the-art hashing approaches. This report also describes variants involving different technique combinations, presents analyses of the efficiencies of these approaches, and discusses methods of improving their efficiencies.
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Shim, Junho photo

Shim, Junho
공과대학 (소프트웨어학부(첨단))
Read more

Altmetrics

Total Views & Downloads

BROWSE