You are viewing a preview of this job. Log in or register to view more details about this job.

PREP0004601 GenAI Robotics Researcher

GenAI Robotics Researcher

Project Description:

This position involves providing software and web development support for evaluation activities in the Artificial Intelligence Group (774.02) at NIST. Specifically, the work will support NIST GenAI evaluation series (https://ai-challenges.nist.gov/genai).  

The candidate will assist in developing cutting-edge human assessment infrastructure and social cognition assessments that support evaluations in various modalities (such as text, voice, image, and video). The infrastructure includes a web platform, evaluation pipeline (backend/frontend), database schema, and other related tasks. The development includes datasets using frontier AI/LLM tools, conducting experiments, performing statistical analysis, and writing technical reports. The candidate will work alongside NIST researchers to advance measurement science through cutting-edge research and evaluations in the field of Generative AI.

Key Responsibilities: 

  • Evaluate Agentic AI applications in robotics and drones within the manufacturing sector
  • Analyze AI‑generated content by extracting linguistic, syntactic, semantic, and structural features, and using computational methods to compare LLM judgments with human perception and behavioral responses
  • Engineer prompt pipelines to generate controlled AI-generated content across models and other experimental conditions
  • Apply pre/post-processing techniques for adversarial attacks on signals from multiple modalities (Audio, Image, and Text)
  • Apply signal analysis methods (information theory, topological data analysis, spectral methods) on high-dimensional representation spaces
  • Upgrade, maintain, and develop NIST GenAI software and evaluation pipelines
  • Develop baseline systems to benchmark AI models on synthetic content generation and detection
  • Curate datasets for synthetic content detection, aligned with real-world forensic applications
  • Manage data, such as querying and translating data formats/specifications

Required Qualifications: 

  • Proficiency in shell scripts (Bash, etc.) and coding (e.g., Python, Java, JavaScript, R, C, and C++).
  • Proficiency in database management (e.g., PostgreSQL, MongoDB)
  • Background in robotics engineering or AI evaluations involving LLM as judge, study design, and data analysis
  • Real-time code development using a distributed computing environment
  • Experience in applying machine learning and deep learning methods to audio and image processing tasks
  • Experience with Computer Vision and Array Signal Processing (adaptive and non-adaptive filtering)
  • Experience with open-source and API-based AI models in the domain of synthetic data
  • Experience in statistical analysis methods to test and compare AI models on forensic tasks
  • Experience with signal processing techniques like Cosine, Fourier, and Wavelet transforms for feature extraction of digital and time-series data
  • Experience using GPU-accelerated clusters and CUDA for deploying open-source models
  • Experience using cloud computing (such as Azure and GCP) and integrating cloud-based APIs
  • Mastery of the principles, practices, and methods in a specialty of a technical professional field (in science, AI/ML engineering, computer science, robotics engineering, information technology, or mathematics, or related field)

Desired Qualifications: 

  • U.S. Citizen Preferred
  • Previous Experience as a full-stack web developer with tools potentially including using Ruby on Rails: Rails5, Postgres, HTML/CSS, JavaScript
  • Experience with Jupyter notebooks, R, Shiny, and interactive plots 
  • Experience in cross-platform software development (Linux, Mac OS X, Windows) 
  • Experience/interest in machine learning and AI test and evaluation

Other Details:

  • Full-time: the participant is expected to work 40 hours a week
  • Location: the participant will work at the NIST Gaithersburg Campus and telework
  • Duration: this is expected to be a ten-month position. Extensions are sometimes granted depending on the availability of funds.