Issues working with GenAI
Data Privacy
Best practices for data privacy
  • Check data retention policies in T&C of tools that you use
  • Use a private server or work through an API
  • Is the information yours to share? (Drafting v. Review)
  • Can you redact the content?
  • Check enterprise version
Does the tool keep my data?

PDF file

AI_Tools_Data_Retention_Summary_Colored.pdf

4.2 KB

Will my personal information show up online?
  • LLM companies are interested in big data, not specific bits of information
  • To date, few examples of big issues with private data showing up elsewhere online
  • Hyper niche areas may be at slightly higher risk
  • Think about what you have to lose were information to become public
Is your data secure elsewhere?
Copyright Issues
Training Data on Copyrighted Material
Ownership/ Licensing Restrictions on AI-Generated Content
Your Data Used for AI training by publishers
GPT for Universities
Research Integrity
Infiltrating Peer Review
Ethical Writing Checklist

aieugwethics.netlify.app

Ethical Checklist for AI in EU Grant Writing

Changing how we write
Bias Amplification
Generate text from one word: “Jew”, “Black”, “Women”, “Holocaust” (GPT-3)
The Challenge of Overcoming Image Bias
Citations
AI Detection Tools
  • Can AI be trusted to weed out AI
  • Easy to circumvent through paraphrasing
  • Originality.AI Turnitin, ithenticate, Grammarly
AI Detection Techniques
Feature-based: Analyzes statistical properties of text, like perplexity (randomness in word choice) and burstiness (variability in sentence structure). AI models tend to have lower perplexity and consistent burstiness over ~15 words, while human writing shows more variability.
Example: "I was drinking coffee" has lower perplexity than "I was drinking blood."
Model-based: Uses machine learning to compare AI and human-generated texts, training a model to recognize patterns unique to each type.
These detectors are trained on AI text and will always lag behind leading models
Does Detection Work?
Is Using AI Plagiarism?
Plagiarism Detection- Compares to existing sources
AI Detection- Analyzes text structure
GPT in the Classroom
Made with