Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The Future #13

Open
schorrm opened this issue Jul 23, 2020 · 11 comments
Open

The Future #13

schorrm opened this issue Jul 23, 2020 · 11 comments
Labels
governance Project governance and management stuff

Comments

@schorrm
Copy link
Owner

schorrm commented Jul 23, 2020

So, a few things I want to ask the community here:

  1. Do we want to go back on to PyPI? If so, what should the new name be?
  2. Do we have particular goals to work on? I've recently been doing a lot of stuff with retrosheet, mainly with the prebuilt parquet databases from boxball, that could be something, absolutely open to ideas. Do we want to add modelling / projections beyond scraping? We could do something like integrate Marcel the Monkey or something similar (perhaps something more advanced?). Hidden Markov?
  3. Any new datasets that have come out that we don't have yet?
@andersonfrailey
Copy link
Contributor

  1. I'd love for this to be up on PyPI. I don't have a name off the top of my head, but I can try and think of something and will post if inspiration strikes.
  2. Adding projections would be a great feature. It's possible there are already some libraries out there we could integrate. I'm personally interested in creating more visualizations as well.

@tqwhitetech
Copy link

1 ) Agree with the person above, definitely should be on PyPi, but I too don't have a cool name.

Off topic a bit, but do we know where we stand with the issues from the original repo? If there are still unresolved issues in there should we get them brought over here so we can start working through them? Its tough to tell what issues are stuck in pull requests and what just never got done.

@schorrm
Copy link
Owner Author

schorrm commented Jul 31, 2020

I think we have almost all of them covered

@schorrm
Copy link
Owner Author

schorrm commented Aug 3, 2020

Given that nobody has a good name idea, what if we did a Pillow kind of thing? Like if we listed it under pybaseball2 or something, so it'd be pip install pybaseball2, but then import pybaseball?

@andersonfrailey
Copy link
Contributor

That works for me

@schorrm schorrm added the governance Project governance and management stuff label Aug 18, 2020
@schorrm schorrm linked a pull request Aug 18, 2020 that will close this issue
@schorrm
Copy link
Owner Author

schorrm commented Aug 20, 2020

This is up as pybaseball2 on pip, the package is still used as pybaseball. I have the CI set up for publishing on a GH Release.

@schorrm
Copy link
Owner Author

schorrm commented Aug 20, 2020

And another idea: what about implementing OpenWAR here?

@andersonfrailey
Copy link
Contributor

I think that'd be a cool feature to have. I'd offer to help, but we're starting a new semester on Tuesday and my free time will be fairly limited.

@schorrm
Copy link
Owner Author

schorrm commented Aug 23, 2020

A major update: James LeDoux reached out to me, and offered to have me take over the original repo.
I think it would be better to do that, we'll maintain the stars etc, and back under the original name.

@schorrm
Copy link
Owner Author

schorrm commented Aug 23, 2020

Obviously, I would migrate all of our improvements over too.

@TAThor
Copy link
Contributor

TAThor commented Oct 24, 2022

During this off-season, should I have the time, I would like to work on some baseball-related machine learning projects, but I am not sure if adding ML functionality directly to this repo makes sense, or if it would make more sense to create a new repo (e.g., pybaseballML) which implements pybaseball but is dedicated to machine learning projects. I'd like some input on which option you guys think is more preferable. I give a brief example project below.

Pitch Prediction
Given the entire pitch-by-pitch history (potentially alongside other informative data) of a pitcher, train a neural network that attempts to predict the next pitch based on the prior pitch sequence (e.g., in the current at-bat, the pitcher has thrown the sequence [1. Fastball 2. Changeup 3. Changeup]. The neural network gives a prediction of what it thinks will be the next pitch).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
governance Project governance and management stuff
Projects
None yet
Development

No branches or pull requests

4 participants