Skip to content

HanLP语义相似度,希望可以输出句子的embedding以便做存储,提高效率 #1792

Open
@yuxulingche

Description

@yuxulingche

Describe the feature and the current behavior/state.
当前使用sts,输入两个句子,对于大量句子比较,效率太低,虽然可以batch来做,但效率还是不够

Will this change the current api? How?
可以在sts里增加一个输出

Who will benefit with this feature?
sts使用者

Are you willing to contribute it (Yes/No):
No

System information

  • OS Platform and Distribution (e.g., Linux Ubuntu 16.04):
  • Python version:
  • HanLP version:

Any other info
HanLP语义相似度比较的效果不错,非常感谢作者的贡献,但现在有大量句子需要比较,希望HanLP能增加输出句子embedding的功能,先存储,使用时算cos距离,提高实际使用中的比较效率

  • I've carefully completed this form.

Metadata

Metadata

Assignees

Labels

feature requestSuggest an idea for this project

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions