-
@fwmeng88 my understanding is that I'm working on the distance metrics but I'm not sure where to start or what to do. should I work on the |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 2 replies
-
Thank you for the question! The first actual question in our discussion board for this project. Can you please check #4 (comment) and #7 ? These two issues list several distance/similarity/dismimilairty/diversity measurements. This part is also listed in the slides that I have shared (page 9 and 10). You should put your codes in https://github.com/theochem/DiverseSelector/blob/main/DiverseSelector/metric.py. Here is a list of potential useful implementations,
These resources at least can help confirm that some of your implementations are doing the right thing. It is up to you how to set up this module, a function or a class. If you find that are too many small pieces, it's also OK to create a subfolder called metric with an empty Hope this explains things a little. Please feel free to keep posting questions here and tag me. |
Beta Was this translation helpful? Give feedback.
-
Change log: I modified the title a little to avoid some ambiguity. |
Beta Was this translation helpful? Give feedback.
-
@fwmeng88 , looking at https://github.com/theochem/DiverseSelector/blob/1883213331c0782ba2534457a719b6598ade6680/DiverseSelector/metric.py#L41 Is there a reason to use cdist instead of pdist? I think it makes sense to use pdist where we can, as it allows constructing the whole distance matrix in a single shot, which will be useful in many cases. But maybe we should support both "computing the whole distance matrix" and not doing so, (so that kd-tree or similar can be used). Looking at Scipy documentation, it seems like we could probably support all the scipy distance functions as one type of distance, and then support additional distance functions with a somewhat different package. If we use the same interface, when the keyword was a scipy keyword we'd default to scipy, and otherwise we'd default to our own code. (Or we could contribute to scipy, but that tends to be pretty involved.) |
Beta Was this translation helpful? Give feedback.
-
Thank you for the question. I will try to divide them into smaller pieces. why we use
|
Beta Was this translation helpful? Give feedback.
Thank you for the question! The first actual question in our discussion board for this project.
Can you please check #4 (comment) and #7 ? These two issues list several distance/similarity/dismimilairty/diversity measurements. This part is also listed in the slides that I have shared (page 9 and 10).
You should put your codes in https://github.com/theochem/DiverseSelector/blob/main/DiverseSelector/metric.py.
Here is a list of potential useful implementations,
These resources at least can help confirm that some of your implementations ar…