Sebastian Raschka
2016-02-19 16:36:29 UTC
Hi, Stelios,
I am wondering, how did you implement this tweak? Just a thought, but instead of adding extra functionality inside the GridSearch class, what about using a random training data selector (transformer) as a pipeline object? Something along the lines of
class RandomRowSelector(object):
def __init__(self):
pass
def _some_random_sampling_function(self, X, y)
def transform(self, X, y):
sampled_rows = self.some_random_sampling_function(self, X, y)
return X[sampled_rows, :], y[sampled_rows, :]
def fit(self, X, y=None):
return self
Best,
Sebastian
I am wondering, how did you implement this tweak? Just a thought, but instead of adding extra functionality inside the GridSearch class, what about using a random training data selector (transformer) as a pipeline object? Something along the lines of
class RandomRowSelector(object):
def __init__(self):
pass
def _some_random_sampling_function(self, X, y)
def transform(self, X, y):
sampled_rows = self.some_random_sampling_function(self, X, y)
return X[sampled_rows, :], y[sampled_rows, :]
def fit(self, X, y=None):
return self
Best,
Sebastian
Hi everyone,
I was thinking to implement a tweak where it is possible to sample randomly from a dataset when using grid search. This would particularly useful for big datasets. The sampling takes place during each round of grid search.
Does anyone think this would be worthy submitting to scikit-learn?
Best regards,
Stelios
------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140_______________________________________________
Scikit-learn-general mailing list
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
I was thinking to implement a tweak where it is possible to sample randomly from a dataset when using grid search. This would particularly useful for big datasets. The sampling takes place during each round of grid search.
Does anyone think this would be worthy submitting to scikit-learn?
Best regards,
Stelios
------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140_______________________________________________
Scikit-learn-general mailing list
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general