• 1 Post
  • 24 Comments
Joined 4 months ago
cake
Cake day: July 14th, 2024

help-circle

  • the actual license text part being questioned .

    Data Information: Sufficiently detailed information about the data used to train the system so that a skilled person can build a substantially equivalent system. Data Information shall be made available under OSI-approved terms.

    In particular, this must include: (1) the complete description of all data used for training, including (if used) of unshareable data, disclosing the provenance of the data, its scope and characteristics, how the data was obtained and selected, the labeling procedures, and data processing and filtering methodologies; (2) a listing of all publicly available training data and where to obtain it; and (3) a listing of all training data obtainable from third parties and where to obtain it, including for fee.

    (The rest of the license goes on to talk about weights, etc).

    I agree with you somewhat. I’m glad that each source does need to be listed and described. I’m less thrilled to see “unshareable” data and data that cost $ in there since i think these have potential to effectively make a model not able to be retrained by a “skilled person”.

    It’s a cheap way to make an AI license without making all the training data open source (and dodging the legalities of that).















  • Yes, oracle will reclaim your server if it falls under certain thresholds for the resources you’ve signed up for. So it might be better to request less resources then you need but this will somewhat complicate things if you want more resources in the future since iirc you can’t simply resize.

    One way to get around all of this though is convert to pay as you go (PAYG). PAYG gets the same always free allocations and you only pay for use above that, and oracle won’t reclaim PAYG (at least not my server for ~4 years). Just set up a budget of a $1 and then alerts to email you if you reach 1% of your budget. If you somehow go over your free resources it’ll tell you.

    Lastly in some cases oracle just straight up loses your data or disables your account. As always practice 3-2-1 backups (don’t rely on the free rotating backups on their servers as your only backup).

    It’s some hoops to jump through but i was paying $5/ month for a digital ocean droplet and the oracle server has been running for 4 years now, and i also have scaled up one project and started a few others that wouldn’t have all fit on my droplet. Other than the threat of reclaiming my resources before i switched to PAYG I’ve been pretty happy with it.


  • The first part of the article talks about how to use git notes and has an example commit, followed by adding the note, and then viewing the note. This is all native git.

    The “problem” is that we have centralized discussions in github/gitlab comments and if we want to retain that data then we need to convert the comments into gitnotes. The CLI part is that specific discussion on how Symfony uses git notes to store github comments. It references an internal CLI but then goes through an example of how to use github api to fetch the comments, create git notes, then push those git notes to github. So while the symfony CLI is internal, it looks like we’re given an example of how to do this for github.