NASA Enterprise Directory

Names and contact information of NASA employees and contractors. 102,615 entries, each containing name, email, and phone.

Data and Resources

51949EC4-BBD5-4170-A209-17782B54DB3F.zip.zip
Archive containing all captured raw data, scripts used for data extraction,...
Explore
- More information
- Go to resource
NASA_Directory.csvCSV
List of names and contact information of NASA employees and contractors in...
Explore
- Preview
- Download
bag-info.txt.txt
Explore
- More information
- Go to resource
bagit.txt.txt
Explore
- More information
- Go to resource
manifest-md5.txt.txt
Explore
- More information
- Go to resource
tagmanifest-md5.txt.txt
Explore
- More information
- Go to resource
51949EC4-BBD5-4170-A209-17782B54DB3F.htmlHTML
NASA directory search
Explore
- More information
- Go to resource
51949EC4-BBD5-4170-A209-17782B54DB3F.jsonJSON
Explore
- More information
- Go to resource
NASA_Directory.csvCSV
Explore
- Preview
- Download
raw_pages_email.zipZIP
Explore
- More information
- Go to resource
raw_pages_first_name.zipZIP
Explore
- More information
- Go to resource
raw_pages_last_name.zipZIP
Explore
- More information
- Go to resource
raw_pages_phone.zipZIP
Explore
- More information
- Go to resource
track.gifGIF
Explore
- Preview
- Download
people.nasa.gov.1
Explore
- More information
- Go to resource
index.html.1
Explore
- More information
- Go to resource
base.csstext/css
Explore
- More information
- Go to resource
ned.csstext/css
Explore
- More information
- Go to resource
reset-fonts-grids.csstext/css
Explore
- More information
- Go to resource
nasa_header_161616.pngPNG
Explore
- Preview
- Download
nebula.jpgJPEG
Explore
- Preview
- Download
ned_logo_161616.pngPNG
Explore
- Preview
- Download
search
Explore
- More information
- Go to resource
people.nasa.gov-2017-02-28-aa1b78a0-00000.warc
Explore
- More information
- Go to resource
people.nasa.gov-2017-02-28-aa1b78a0-00000.warc.gz
Explore
- More information
- Go to resource
01_scrape_script_email.pytext/x-python
Explore
- More information
- Go to resource
02_scrape_script_last_name.pytext/x-python
Explore
- More information
- Go to resource
03_scrape_script_first_name.pytext/x-python
Explore
- More information
- Go to resource
04_scrape_script_phone.pytext/x-python
Explore
- More information
- Go to resource
05_extracting_table_data.pytext/x-python
Explore
- More information
- Go to resource

Additional Info

Field	Value
Source	https://people.nasa.gov
Version
Author
Author Email
Maintainer
Maintainer Email
Shared (this field will be removed in the future)	Open
IB1 Sensitivity Class
IB1 Trust Framework
IB1 Dataset Assurance
IB1 Trust Framework
Free text description of capture process	Python: Selenium and PhantomJS for scrape, LXML for parse. Ran an exhaustive series of searches by constructing URLs. Began by searching the email field for all valid two-character combinations, followed by the wildcard '*'. If a search returned too many results to display on one page (more than 100), exhaustively appended an additional character in the next round, and so on. The process ended when searches no longer returned too many results to display on a single page. To find directory listings without email addresses, I repeated the process for last names, first names, and phone numbers. If a field included >100 identical entries, I constructed additional search loops on a case-by-case basis, all of which are included in the attached scripts. Because pages were rendered using JavaScript, I used a headless browser via Selenium and PhantomJS in Python to convert pages to static HTML. I parsed the resulting HTML files using LXML in Python, then wrote all data to a comma-delimited CSV using the package unicodecsv.