Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sga() and create_binning() have different default nbins #69

Open
jbao opened this issue Feb 7, 2017 · 7 comments
Open

sga() and create_binning() have different default nbins #69

jbao opened this issue Feb 7, 2017 · 7 comments

Comments

@jbao
Copy link

jbao commented Feb 7, 2017

Also, use different defaults for categorical (e.g. 4) and numerical (e.g. 8).

@aravs16
Copy link

aravs16 commented Feb 22, 2017

Hi I can work on this. This is my first contribution. A little guidance would be helpful.

@jbao
Copy link
Author

jbao commented Feb 22, 2017

Hi @aravs16 , thanks for volunteering, this issue is mostly about syncing up the default arguments in

https://github.com/zalando/expan/blob/dev/expan/core/experiment.py#L715

and

https://github.com/zalando/expan/blob/dev/expan/core/binning.py#L732

Ideally we could somehow distinguish between categorical and numerical bins, but if that's not possible, we may also completely leave out the argument nbins, since a random number probably would not make sense. Let us know if anything is unclear;-)

@aravs16
Copy link

aravs16 commented Feb 23, 2017

@jbao Thank you!

Can we make nbins = 8 in both places and then pass nbins//2 on this line:
https://github.com/zalando/expan/blob/dev/expan/core/binning.py#L768 ?

@jbao
Copy link
Author

jbao commented Feb 24, 2017

hmm, I think we should make nbins transparent to the user, and not doing any post-processing within the function itself, what do you think?

@yagudin
Copy link

yagudin commented Mar 17, 2017

Hi @jbao, in my opinion, we should set nbins=None in definitions of sga() and creat_binning(). And add post-processing ≈ if nbins == None: nbins = 8 if is_numeric else 4 followed with a warning.

@gbordyugov
Copy link
Contributor

Hi everyone,

I found using None as a flag for default (as opposite to undefined) value quite pervasive in ExpAn (for example, None in place of KPI names or feature names implies "do all"). Was it a deliberate design decision? What happens to those options where we want None to signalize the absence of information, rather than the intention to use default values?

@jbao
Copy link
Author

jbao commented Mar 20, 2017

I think we should specify the default value explicitly in the argument if we have one (e.g. in the case of nbins), and use None to imply that the function will take all possible input values (e.g. in the case of kpi_subset). What do you think?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants