What is a Community Garden in the Digital Age? – Data Management Plan | DHUM 70002 Digital Humanities: Methods and Practices (Spring 2022)

What are the types of data that may be produced as part of this project?
- This data will be collected via already publicly available data from websites such as NYC Opendata, GreenThumb Gardens website, GrowNYC among other individual community Garden websites. We will also focus on completely different sets of available datasets such as income by location in NYC, food deserts in NYC, among other ideas that will come up. In the following weeks, one of our team members will start to reach out to the community and begin an ethnographic study. This data will be new and be produced, edited by the team as a whole. It will consist of some recorded interviews as audio-file, transcription of the interviews and handwritten notes from visiting different community gardens. We plan to use information that the participants allow us to share. Information such as quotes, pseudonyms and overall group trends will be shared, and each interview will be stored on our shared google drive. This data is completely static and as long as the original hosting website is available, there should be a minimal risk of losing data. The only new data that will be produced is our cross analysis of any trends we see among different data sets. Extrapolation would be our “new data.” This would be stored in our shared google drive as our main document management system. This data would be static and this can evolve over time but we do not expect this to evolve during the next few months. We are planning to use free available software such as Tableau online and possible wordpress websites. TBD on the amount of data we are able to present. During this project we are using Google drive that is linked to three of our google accounts. If one of us loses access, then one of the other two can download any past data. We can also email or message each other with documents so we can have different avenues of sharing data.

What standards will you be using for data collection, documentation, description, and metadata?
- We will continue to use google drive for our dataset storage. We will aim to download easy to open files formats such as Excel, CVS, Word Documents. We will document any findings in a shared MS Word document file. Our Ethnographer will determine if any information is deemed private and will determine how to store that data in a private separate file. We will all review the way we store the documents, we will constantly vet and modify the files as we see fit and bring up any data that seems confidential to the team. We should all be responsible for informing each other with anything we see. We will have folders with file hierarchy such as organizing per week or per location. We will name each file with a main topic on the title and update it as we add on to the document. We will use standard data naming conventions as “Final project” or “interviews on xx/xx”

What steps will you take to protect your or your participant’s security, privacy/confidentiality, intellectual property, or other rights? (Check current university policies for requirements.)
- We will have complete ownership of interview data, we will determine if the interview data should be kept confidential in the most ethical way. Each visit and interview we make with community gardens, we will make it clear if the wish to be private and to what degree. Additionally will we make it clear to them how we are gonna use the interviews and they are not obligated to answer and can step out of the project if they wish. Since we are using already available and public data, there should be minimal reason to restrict or review this data. However, as Data Humanists we would need to be careful on determining if this data can be used for political and Nefarious reasons. If so, we would try our best to broaden the datasets and naming conventions.

If you allow others to reuse your data, how will the data be accessed and shared?
- The data we create and publish will be under standard sharing protocols. The data will not be restricted nor behind a paywall. We expect a large audience to use this information, one being community leaders themselves using this information to enhance their connection to their gardens. As we are extrapolating data as well, we expect future researchers to further this work or use it as a foundation for their research. We plan to export and publish this data on wordpress and/or CUNY commons. We hope this data will live as a historical preservation on data collected after the pandemic of 2020.

How will the data be archived for preservation and long-term access?
- Our internal data collection and method of collection will be stored in our personal google drive accounts. The external data will be available until CUNY common is no longer available to the public. We will stick to simple, ample and easy to access formats like Microsoft Office.