Web/Data Scraping Specialist

Folio

Folio

United States
Posted on May 9, 2025

We’re looking for a Web/Data Scraping Specialist to help identify and collect data on digital

communities where students and emerging professionals are active online.

You’ll use tools like Python, BeautifulSoup, Scrapy, and/or Selenium to programmatically

discover and extract publicly available information (e.g., Discord servers, subreddit names,

Instagram handles, and more). The goal is to build a clean, well-structured database of outreach

targets to fuel our growth campaigns. This is a paid internship ideal for a technically-inclined student or freelancer who enjoys open-ended discovery, internet research, and building useful tools from scratch.

What You'll Do:

  • Use Python (or similar) to identify and extract publicly available data from online platforms such as Discord, Reddit (e.g., relevant subreddits), Instagram, GroupMe, ZeeMee, Facebook Groups, and other niche or school-specific digital communities
  • Develop scrapers or scripts using libraries like BeautifulSoup, Scrapy, or Selenium depending on platform constraints
  • Build and maintain a structured outreach directory (e.g., CSV, Airtable, or Google Sheets format) with metadata including platform, audience type, school focus, and engagement notes
  • Ensure ethical scraping practices and compliance with platform terms of service
  • Collaborate closely with the team to hand off clean, actionable data for engagement

You Might Be A Good Fit If You:

  • Have experience with Python and web scraping tools (e.g., BeautifulSoup, Scrapy, Selenium)
  • Are familiar with REST APIs or browser automation a plus
  • Are comfortable navigating and researching across a wide range of online platforms
  • Can structure and clean messy or unstructured data
  • Are a self-starter who thrives with minimal supervision
  • Bonus: Experience organizing datasets for use by non-technical teams

What You'll Gain:

  • Help build a startup from the ground up with a clear social mission
  • Direct impact: your work powers outreach to caregivers who support families in need
  • Flexible, remote-friendly, results-driven environment
  • Work directly with the founder and growth team
  • Opportunity for contract extension or more technical work in the future