-
Notifications
You must be signed in to change notification settings - Fork 289
Function to create initial data docs #1681
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Would you like to make a PR for consideration? Full disclosure: I'm not 100% convinced that usethis should do this. But this is the topic of an issue I recently closed in R Packages, which contains some concrete ideas to start with. |
As I continue to work on R Packages, I've learned there are packages that already offer functions to do this. One example is sinew (https://cran.r-project.org/web/packages/sinew/index.html). So given that there are solutions out there already, I don't think it's a priority for us to add this to usethis. |
Just wanted to add to this since I had the same thought. |
Hi, based on @jennybc suggestions, I created this function to help with the documentation. This is under construction, so any comments will be constructive. I like your idea of use_data(., doc = TRUE), so maybe they will re-open this issue in the future. I am also a big fan of use_this. |
OK, we'll reconsider. |
We probably also need to think a little about how we organise data documentation files. We currently tend to dump all data docs into a single |
This comment was marked as outdated.
This comment was marked as outdated.
This comment was marked as outdated.
This comment was marked as outdated.
The FAIR framework can be a helpful resource for organizing dataset documentation files. From my experience, most researchers only describe the variables in the dataset, but including good metadata can make the datasets more valuable and useful. Thank you for considering this issue as a potential feature. |
@focardozom I'm not familiar with FAIR. Can you please summarise how it might inform a function that automatically creates a documentation template? |
@hadley FAIR can be used as a checklist to decide what information should be included in the template created by the function. Following the FAIR, the template should include two categories: (1) metadata, which includes information that helps others find, access, and use the data, such as details about how the data was gathered, licensing, file size, format, etc. Some of this information can be automatically extracted from the data object and included in the template, while other information should be suggested to the user to fill in. (2) The template should also include spaces to describe the variables. Users can use this template to ensure that they at least include basic elements recommended by guides like FAIR. |
I am talking with my mentor @RaymondBalise. We looked at how you documented datasets in I have been reviewing https://design.tidyverse.org and I would love to apply what I have learned. Can I see how you are coding this or can I help? |
Labelling for tidyverse dev day. Overall advice: start small, aim for an MVP (so: probably not everything you see discussed above). |
Hi! document data is always tricky, so having a function to help people document the dataset would be fantastic. This function could create an .R file in the R folder containing information gathered from the data set. For example, the information can be inserted in a roxygen template, and the @Items values can be filled using
glue()
. The template could also include some descriptive information to help users understand the dataset better.The text was updated successfully, but these errors were encountered: