Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle the multi-label task? #7

Open
hsiangsky opened this issue Dec 27, 2014 · 9 comments
Open

Handle the multi-label task? #7

hsiangsky opened this issue Dec 27, 2014 · 9 comments

Comments

@hsiangsky
Copy link

Hi Chou,

It seems that the libdnn couldn't handle the multi-label task.
Would you like to add this new feature in the future?

@poweic
Copy link
Owner

poweic commented Jan 3, 2015

Sounds like a good idea !!
I'll add it in the future release. Thanks : )

@supergrover
Copy link

I'm interested in this too. Have you already worked on this? If not, I forked your repository and am working on it right now.

@poweic
Copy link
Owner

poweic commented Jan 4, 2015

That's great !! I haven't started yet.

Here's my idea. It's rough and maybe you can help me with this.
Originally, the label comes along with the feature. Like this:

12 1:1 2:0.5 10:0.7 ...

where 12 is the label, and the rest 1:1 2:0.5 10:0.7 ... represent the feature.

For multi-labeled feature, I was thinking about providing another label file. Like this:

# feature file
1:1 2:0.5 10:0.7 ...

where 12 is missing and 1:1 2:0.5 10:0.7 ... still represent the feature.

# label file
12 15 17

where the above feature not only labeled 12, but also 15 and 17.

Besides, nn-train and class BasicStream should tell the difference between them and ask user to provide an additional label file if it's multi-labeled.

Any suggestion?

@supergrover
Copy link

That was pretty much what I had in mind. I will work out some stuff (still reading in on your code), and will come back on this.

@poweic
Copy link
Owner

poweic commented Jan 8, 2015

Hi~ @supergrover, @hsiangsky
Have you guys started yet ?

I almost finish the support of multi-label.
Lots of refactoring in src/data-io.cpp, src/dataset.cpp, include/data-io.h and include/dataset.h.
Guess it's going to be cleaner.

@supergrover
Copy link

@botonchou working on it now

@poweic
Copy link
Owner

poweic commented Feb 1, 2015

Because my data IO class sucks, I decided to refactor it first.
it's on another unpublished branch. (it's more readable and cleaner now)

Most of the functions needed in multi-class classification are done.
(Except for the measurement of multiclass accuracy.)
I'm going to publish it in about a week.

Do you want to start from that branch ?

@supergrover
Copy link

Yes, please push. I will play around with it, although for consistency I think you should finish it

@poweic
Copy link
Owner

poweic commented Feb 1, 2015

It's on branch feature/multi-label. Thanks : )

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants