Any RL package will have be heavily focused on non-iid data (timeseries,
effecting/interacting with the environment it is operating in. I agree with
data/these models (RNNs specifically) all that well. The models used are
connections, losses, and so on. All the things that make it really hard to
just load an object, fit, and predict.
It is a hard problem, and a good RL package would be useful. PyRL (
to do specifically, hard to do generically". The generic case usually blows
are already dealing with at least one DSL (Theano/Tensorflow) if not more.
Post by Gael VaroquauxPardon me if I am saying something stupid, but isn't Theano/Tensorflow
about deep learning and not reinforcement learning. RL can be done with
deep learning, but it's more than that, and I suspect that it requires a
different API, in particular with the notion of actions.
G
You mean a scikit-like interface to Theano/Tensorflow? Thatâs actually
what skflow intends to do.
Post by Nadim FarhatI was just thinking the same but , how about just making pipelines to
Theano , TensorFlow ?
Post by Nadim FarhatI am not a core developer and thus really canât comment about the
scope of scikit-learn here :P. But I am a curious about how to implement it
in scikit-learn efficiently. I think an implementation based on Theano or
TensorFlow may be a better place for such a module (maybe skflow, which has
a scikit-like API https://github.com/tensorflow/skflow?)
Post by Nadim FarhatOn Mar 2, 2016, at 2:21 PM, MichaÅ Koziarski <
Hello everyone,
As far as I can tell, except PyBrain (which doesn't seem to be
actively developed) there are no reinforcement learning libraries in
Python. I was wondering if community would be interested in using one and
making it a part of scikit-learn. Does it lie within the scope of the
project?
Post by Nadim Farhat- to design common interface, similar to what is used in other parts
of scikit-sklearn;
Post by Nadim Farhat- to implement established RL algorithms, reliant heavily on
estimators available in scikit-learn;
Post by Nadim Farhat- and to prepare practical examples of what RL can be used for, to
both supplement documentation and encourage people not yet familiar with RL
to experiment with it in their own projects.
Post by Nadim FarhatOnce again, I would mostly like to know whether it event lies within
the scope of the project, or if it just won't be added because of project
philosophy. Other than that, I would obviously appreciate any feedback.
Post by Nadim FarhatAbout me: I am a last master's CS student. My research interests
involve machine learning in general and reinforcement learning in
particular; this year I hope to start my PhD on the latter. My master's
thesis revolves around transfer learning in RL. I have experience with
programming in industry and on large projects.
------------------------------------------------------------------------------
Post by Nadim FarhatSite24x7 APM Insight: Get Deep Visibility into Application
Performance
Post by Nadim FarhatAPM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140_______________________________________________
Post by Nadim FarhatScikit-learn-general mailing list
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
------------------------------------------------------------------------------
Post by Nadim FarhatSite24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140
_______________________________________________
Scikit-learn-general mailing list
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
------------------------------------------------------------------------------
Post by Nadim FarhatSite24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140_______________________________________________
Post by Nadim FarhatScikit-learn-general mailing list
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140
_______________________________________________
Scikit-learn-general mailing list
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
--
Gael Varoquaux
Researcher, INRIA Parietal
NeuroSpin/CEA Saclay , Bat 145, 91191 Gif-sur-Yvette France
Phone: ++ 33-1-69-08-79-68
http://gael-varoquaux.info http://twitter.com/GaelVaroquaux
------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140
_______________________________________________
Scikit-learn-general mailing list
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general