Initial Data Producer Guide for VEDA (seeking feedback)#188
Initial Data Producer Guide for VEDA (seeking feedback)#188siddharth0248 wants to merge 2 commits intomainfrom
Conversation
✅ Deploy Preview for harmonious-cajeta-5542ab ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
|
@siddharth0248 Thank you for contributing! A lot of this guidance is specific to adding data to VEDA instances, especially regarding the workflow. Any VEDA-specific content should go in VEDA docs. I think much of the data preparation content could be helpful as a part of #183 . I think we should make a branch for the decision framework, add that content to it, and then we can iterate on the decision framework from there. How does that sound to you? |
|
Thanks for the PR @siddharth0248 As currently written this seems more like a documentation of how VEDA makes decisions and VEDA specific requirements related data cataloging. I would lean towards this material without changes belonging in https://docs.openveda.cloud/ with a link from our Cookbooks section to this as an example of how one organization makes decisions. The other approach I might suggest, is to move this PR to be a cookbook, and rework some of the text to explain that this is how VEDA decided to do things and why. I will also note that some of this material is clearly a precursor to #183 , but we would need to make the choices more generic and describe more scenarios in depth to explain why one might pick one format over another. e.g. there are several non-cloud optimized formats mentioned here, and there might be some significant disagreement over their inclusion. |
|
Thanks @abarciauskas-bgse @wildintellect, that sounds like a good plan. I agree that the VEDA-specific workflow content should live in VEDA docs, and we can keep the more general data preparation guidance here. For #183, I’ll create a branch (if you want) focused on the decision framework and move over the relevant sections (formats, chunking, compression, etc.) so we can iterate there. I can also start structuring it as a decision guide to align with the goal of that issue. Let me know if that direction works or if you have something different in mind for the framework. |
Summary
This PR adds an initial version of a Data Producer Guide for VEDA instances.
It includes guidance on:
Notes
Goal
Provide a clear, consistent starting point for onboarding datasets into VEDA.