Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement HOMER, ERFH and MLRF algorithms #81

Merged
merged 19 commits into from
Oct 16, 2023
Merged

Conversation

agkphysics
Copy link
Contributor

This work was completed in 2018 as a group project for the course CS760 at the University of Auckland, under the supervision of @joergwicker. I finally got around to cleaning up and fixing the code, five years later.

We implemented and tested four MLL algorithms, although only three are in this pull request because the other one isn't working properly.

The three algorithms in the PR are HOMER [1], ERFH [2], and MLRF [3].

I have left this as a draft PR to discuss the utility of adding these algorithms before actually merging the code.

References

[1] G. Tsoumakas, I. Katakis, and I. Vlahavas, “Effective and efficient multilabel classification in domains with large number of labels,” in Proc. ECML/PKDD 2008 Workshop on Mining Multidimensional Data (MMD’08), 2008, pp. 53–59.
[2] J. Li, Y. Zheng, C. Han, Q. Wu, and J. Chen, “Extremely Randomized Forest with Hierarchy of Multi-label Classifiers,” in Intelligence Science and Big Data Engineering, Y. Sun, H. Lu, L. Zhang, J. Yang, and H. Huang, Eds., in Lecture Notes in Computer Science. Cham: Springer International Publishing, 2017, pp. 450–460. doi: 10.1007/978-3-319-67777-4_40.
[3] H. Elghazel, A. Aussem, O. Gharroudi, and W. Saadaoui, “Ensemble multi-label text categorization based on rotation forest and latent semantic indexing,” Expert Systems with Applications, vol. 57, pp. 1–11, Sep. 2016, doi: 10.1016/j.eswa.2016.03.041.

agkphysics and others added 19 commits October 9, 2023 19:02
…R. Changed subset clustering algorithm to use culled instances at each node.
…red to labels. Changed params of RandomForest for ERFH.
@fracpete
Copy link
Member

Haven't seen any new methods for a while, so fine by me.

@fracpete fracpete marked this pull request as ready for review October 16, 2023 01:41
@fracpete fracpete merged commit f6d1bca into Waikato:master Oct 16, 2023
1 check failed
@fracpete
Copy link
Member

Reformatted the code and added junit tests. MLRF is still failing its unit tests. You can try the default classifier set up on the Music dataset. Fails in line 200 with java.lang.ArrayIndexOutOfBoundsException: Index 7 out of bounds for length 7.

fracpete added a commit that referenced this pull request Aug 25, 2024
Fix MLRF parameters. Follow up to #81
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants