-
Notifications
You must be signed in to change notification settings - Fork 194
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Should the faster countmap
for CategoricalArray sit in this repo?
#340
Comments
That's a difficult question. I don't think CategoricalArrays should depend on StatsBase given that StatsBase depends on other packages like SpecialFunctions, which isn't pure Julia. OTOH StatsBase could depend on CategoricalArrays since it doesn't add new dependencies (except that we are about to add a dependency on JSON, which is unfortunate but needed for now, see JuliaData/CategoricalArrays.jl#96). I'm not sure how others feel about this. See also #228. For now my approach has been to support CategoricalArrays in FreqTables. Maybe we should move counting functionality there, and include FreqTables in the Stats meta-package we've been talking about for a long time. The advantage of that approach is that people wouldn't be lead to believe we only provide very basic functions like |
How about a CategoricalArraysTools.jl package where things like faster sort and countmap can go? |
Well, these functions are not only for |
I mean there are goinfg to be sort and countmap that are specialised for CategoricalArrays |
Yes, but since these functions also support other types it would be weird to have to load a package which has "CategoricalArray" in its name. |
I don't mean all fast sorting methods. Just the methods that has signature That would work I think. |
Then you'd have the slow methods by default, unless you do |
thats a good point. so we should have them in CategoricalArrays |
For reference, optimized methods can now be implemented in StatsBase using |
Should the faster
countmap
for CategoricalArray sit in this repo? Or in theCategoricalArrays.jl
. Arguments can be made for both. But I am leaning towardsCategoricalArrays.jl
.So that
CategoricalArrays.jl
should not impactStatsBase.jl
The text was updated successfully, but these errors were encountered: