Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about the unknown type high quality phrases. #37

Open
failable opened this issue Dec 6, 2019 · 1 comment
Open

Questions about the unknown type high quality phrases. #37

failable opened this issue Dec 6, 2019 · 1 comment

Comments

@failable
Copy link

failable commented Dec 6, 2019

Hi, the original paper says

In our AutoNER model, these “unknown” positions have undefined boundary and type losses, be- cause (1) they make the boundary labels unclear; and (2) they have no type labels. Therefore, they are skipped.

Is that mean high quality phrase should not have entity types that we are trying to identify? Otherwise, the model will predict it as Entity Type: None as shown in Figure 2 for 8GB RAM. And if AutoNER is applied to the example of Figure 1, can it and should it identify prostaglandin synthesis as a named entity?

Thanks.

@houking-can
Copy link

houking-can commented Dec 8, 2019

@Isolet
The related work says,

aspect term extraction, which can be viewed as an entity recognition task of a single type for business reviews. As shown in our experiments, our models can outperform Distant-LSTMCRF significantly on the laptop review dataset.

Because laptop review dataset has no entity type for entity mentions, I think there is only a single type for the latop review dataset, ComputerTerm or not. That's why they can be compared with the Aspect Term extraction model

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants