Senior Data Scientist
18-Dec-2024
Senior Data Scientist
Harvard University Information Technology
67656BR
Position Description
This is a fully benefited, full-time Harvard University position that has been funded through 7/31/2026. There is the possibility of renewal, contingent on funding, university priorities and satisfactory job performance.
To support researchers and faculty members at the Harvard Data Science Initiative (HDSI), University Research Computing and Data Services (Univ. RCD) is partnering with HDSI to hire a Senior Data Scientist. The position will be instrumental in supporting the computational needs of Harvard faculty-led projects and must be able to understand complex research problems at a fundamental level. Broadly, this individual will contribute to data science-driven projects by constructing outputs intended for broad open access by different stakeholders. Furthermore, this role is pivotal in supporting research activities at HDSI by developing advanced data models, managing architectures, utilizing cloud services for scalable data processing, employing state-of-the art statistical techniques, managing and analyzing large datasets, and applying machine learning algorithms and large language models to derive meaningful insights. Interactions with industry partners such as Amazon Web Services will provide valuable exposure to advanced high-performance cloud computing capabilities.
Univ. RCD within the central Harvard University Information Technology organization maintains a dynamic and diverse community of engineers dedicated to evolving the University’s research computing services. They proactively identify gaps and emerging needs, providing solutions to address them. HDSI supports research in data science methodology and applications through multiple programs, including industry-sponsored research collaborations that align with research interests across Harvard schools, with a focus on understanding, mitigating, and finding solutions to global health, environmental, food, and social crises.
This position operates within a team of senior engineers and reports to the head of Research Software Engineering in Univ. RCD, with the HDSI Scientific Director providing strategic oversight. The ideal candidate will demonstrate a passion for pushing the boundaries of Data Science technologies to solve urgent global challenges, enthusiasm for contributing to a collaborative and innovative environment, and an affinity for joining teams of academic researchers to advance knowledge for society’s benefit. Harvard University encourages candidates with diverse backgrounds and fresh perspectives to join Univ. RCD and HDSI.
Typical Core Duties:
- Advise researchers in the design, planning and implementation of data science workflows, tools and pipelines that enriches research productivity and reliability.
- Design and implement data set processes that allow for data modeling and mining.
- Assist with cloud computing-based infrastructure needs, data processing and analytics.
- Work closely with multiple stakeholders to define an overall multi-project plan that is continually revised, incorporating various inputs including dependencies, project contractual milestones, and priorities, for efficiency, transparency and managing stakeholder expectations.
- Build;
- and maintain aspects of custom data science tools, data processing pipelines, and software code for complex environments.
- internal code design and development guides for future contributors.
- advanced curriculum and teach workshops for researchers on sustainable data science workflows, software, and data management practices.
- Apply firm understanding of specific technology to develop custom solutions to meet researcher’s needs.
- Work in a team of developers and researchers in collaboration with research computing professionals.
- Provide regular communications to PI’s/stakeholders with project updates.
- Abide by and follow the Harvard University IT technical standards, policies and Code of Conduct.
Basic Qualifications
- Minimum of seven years’ post-secondary education or relevant work experience
Additional Qualifications and Skills
- Bachelor's or Master’s Degree in Statistics, Data Science, Computer Science, Mathematics, Informatics, or other health data related field.
- Prior work within a research environment is essential, including familiarity with the research pipeline and the process of conducting research employing accepted scientific experimental practices.
- Knowledgeable of;
- data engineering, data architecture, database management, and data visualization techniques, with high proficiency in data extraction and wrangling.
- numerical methods, statistical analysis, and machine learning. More specifically, experience fitting and interpreting a range of models, including at least some of: GLM, GLMM, SEM, econometric models, machine learning models.
- The ability to create and maintain databases using libraries from Python, R in Linux environment.
- Expertise in leveraging cloud platforms, especially Amazon Web Services (AWS), for scalable data processing and analytics (EC2, S3, Redshift, Lambda), and machine learning tools (SageMaker, Glue, Athena).
- Experience with;
- data warehousing and ETL/ELT processes. Skilled in SQL, NoSQL databases, and data modeling techniques. Experience with big data technologies and ecosystems (e.g. Hadoop, Spark).
- CI/CD pipelines for data science projects and their reliable deployment. Assisting with release of models/products to the proper platform (e.g., a website, an interactive API, etc.), including infrastructure design.
- Background in scientific programming/scripting (Python, R, Stata, and C++);
- 3+ years of experience using either Python or R in a data science and/or research context required.
- 5+ years of this experience preferred. More specifically, advanced skills in Python libraries for data science (Pandas, NumPy, sci-kit learn, TensorFlow/PyTorch) or experience using object-oriented programming systems in R (e.g., S3, S4, RC, R6).
- Adherence to best practices in scientific programming, including version control (Git), code review, unit testing, and documentation to ensure reproducibility and maintainability of data science projects.
- Proven track record of success in working in a cross-functional team in an agile environment.
- Excellent communication skills; able to simplify complex technical concepts to stakeholders.
- Detail-oriented expertise, with strong problem-solving skills to support research.
- Strong team player with a service mindset, able to guide researchers and is customer focused.
- Awareness of and aptitude to appropriately and effectively understand, respect, and adapt to cultural and identity‐based difference within group environments, and experience fostering and reinforcing an environment that values unique experiences, cultures, backgrounds, and goals.
Certificates and Licenses
- Completion of Harvard IT Academy specified foundational courses (or external equivalent) preferred
Working Conditions
- Work is performed in an office setting
- Occasionally required to work outside of normal business hours, and may be contacted during off hours
Additional Information
Please provide a cover letter with your application.
Please note:
- Harvard University requires pre-employment reference and background screening.
- We are unable to provide work authorization and/or visa sponsorship.
- This position has a 180-day orientation and review period.
The health of our workforce is a priority for Harvard University. With that in mind, we strongly encourage all employees to be up-to-date on CDC-recommended vaccines.
Accessibility:
Harvard University welcomes individuals with disabilities to apply for positions and participate in its programs and activities. If you would like to request an accommodation or have questions about the physical access provided, please contact our University Disability Resources Department.
Work Format Details
HUIT actively supports hybrid work where business needs allow. This position has been designated as a Hybrid position. While this position is Hybrid, travel to campus may be necessary based on business needs and the nature of work. Examples include bi-annual or quarterly Town Halls, critical business meetings or other work events. Additional details will be discussed during the interview process. All remote work must be performed within one of the Harvard Registered Payroll States, which currently includes Massachusetts, Connecticut, Maine, New Hampshire, Rhode Island, Vermont, Georgia, Illinois, Maryland, New Jersey, New York, Virginia, Washington, and California (CA for exempt positions only). Certain visa types and funding sources may limit work location. Individuals must meet work location sponsorship requirements prior to employment.
About Us
More About HUIT:
Our Mission: huit.harvard.edu/about
We empower the Harvard community with essential and transformative technologies to advance education, knowledge, and discovery.
HUIT’s core values are:
- Human-centered
- University-focused
- Innovation-driven
- Team-oriented
IT Academy (designed for IT Staff):
HUIT’s IT Academy aims to enable each IT staff person to grow professionally and become a trusted partner to her or his team. The IT Academy is built on the belief that every IT staff member across the University (including technology employees at each school and campus) can grow in her or his area of expertise as well as building strong people and project management skills. Learn more here: https://itacademy.harvard.edu/
Benefits
We invite you to visit Harvard's Total Rewards website (https://hr.harvard.edu/totalrewards) to learn more about our outstanding benefits package, which may include:
- Paid Time Off: 3-4 weeks of accrued vacation time per year (3 weeks for support staff and 4 weeks for administrative/professional staff), 12 accrued sick days per year, 12.5 holidays plus a Winter Recess in December/January, 3 personal days per year (prorated based on date of hire), and up to 12 weeks of paid leave for new parents who are primary care givers.
- Health and Welfare: Comprehensive medical, dental, and vision benefits, disability and life insurance programs, along with voluntary benefits. Most coverage begins as of your start date.
- Work/Life and Wellness: Child and elder/adult care resources including on campus childcare centers, Employee Assistance Program, and wellness programs related to stress management, nutrition, meditation, and more.
- Retirement: University-funded retirement plan with contributions from 5% to 15% of eligible compensation, based on age and earnings with full vesting after 3 years of service.
- Tuition Assistance Program: Competitive program including $40 per class at the Harvard Extension School and reduced tuition through other participating Harvard graduate schools.
- Tuition Reimbursement: Program that provides 75% to 90% reimbursement up to $5,250 per calendar year for eligible courses taken at other accredited institutions.
- Professional Development: Programs and classes at little or no cost, including through the Harvard Center for Workplace Development and LinkedIn Learning.
- Commuting and Transportation: Various commuter options handled through the Parking Office, including discounted parking, half-priced public transportation passes and pre-tax transit passes, biking benefits, and more.
- Harvard Facilities Access, Discounts and Perks: Access to Harvard athletic and fitness facilities, libraries, campus events, credit union, and more, as well as discounts to various types of services (legal, financial, etc.) and cultural and leisure activities throughout metro-Boston.
Job Function
Information Technology
Department Office Location
USA - MA - Cambridge
Job Code
I1258P IT RC Software/Data Prof IV
Work Format
Hybrid (partially on-site, partially remote)
Sub-Unit
------------
Salary Grade
058
Department
University Research Computing and Data
Union
00 - Non Union, Exempt or Temporary
Time Status
Full-time
Appointment End Date
31-Jul-2026
Pre-Employment Screening
Identity
Schedule
- Occasionally required to work outside of normal business hours, and may be contacted during off hours
Commitment to Equity, Diversity, Inclusion, and Belonging
Harvard University views equity, diversity, inclusion, and belonging as the pathway to achieving inclusive excellence and fostering a campus culture where everyone can thrive. We strive to create a community that draws upon the widest possible pool of talent to unify excellence and diversity while fully embracing individuals from varied backgrounds, cultures, races, identities, life experiences, perspectives, beliefs, and values.
EEO Statement
We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, gender identity, sexual orientation, pregnancy and pregnancy-related conditions, or any other characteristic protected by law.
LinkedIn Recruiter Tag (for internal use only)
#LI-BT1