Data Scientist
Avantus Federal - Ashburn, VA

Data Scientist

company building Avantus Federal location Ashburn, VA
135
Data Scientist
jobs
show me
234
jobs in
Ashburn, VA
show me
11
jobs at
Avantus Federal
show me
Find out how you match this company

Job

Description

Salary

Skills

Job Description

Company Overview

Avantus Federal, recently acquired by QinetiQ US, is a mission-focused data, cyber and space services, and solutions company. As a mid-market powerhouse with an intentional blend of elite talent, infrastructure, and speed to impact, Avantus leads with technical and domain expertise for its Defense, Intelligence, Homeland Security and Federal Civilian customers.

Helping to solve some of the toughest national security problems and government missions, Avantus’ offerings enable services at scale, including cyber technologies and operations, data and software solutions, digital engineering and integration, intelligence analysis and operations, transformation, and advisory services and more.

Position Overview

REMOTE/HYBRID Opportunity.

Each day U.S. Customs and Border Protection (CBP) oversees the massive flow of people, capital, and products that enter and depart the United States via air, land, sea, and cyberspace. The volume and complexity of both physical and virtual border crossings require the application of “big data” solutions to promote efficient trade and travel. Further, effective “big data” solutions help CBP ensure the movement of people, capital, and products is legal, safe, and secure.

In response to this challenge, Avantus Federal, as a trusted mission partner of CBP, seeks capable, qualified, and versatile data scientists to help lead the development and delivery of high-quality predictive modelling solutions. Successful applicants will serve as recognized subject matter experts in the application of quantitative methods, machine learning algorithms, and predictive models to address complex national and homeland security challenges. They will help our team to leverage large structured and unstructured datasets to develop and operationalize models, tools, and applications that drive optimized decision making. Project tasks include data collection, mining, data and text analytics, clustering analysis, pattern recognition and extraction, automated classification and categorization, and entity resolution to implement and enhance automated risk assessment. The products we develop provide actionable insight with real and immediate impact on the safety and security of the United States, its citizens, visitors, and economy.

Responsibilities
  • Lead and perform hands-on analysis and modeling involving the creation of intervention hypotheses and experiments, assessment of data needs and available sources, determination of optimal analytical approaches, performance of exploratory data analysis, and feature generation (e.g., identification, derivation, aggregation).
  • Collaborate with mission stakeholders to define, frame, and scope mission challenges where big data interventions may offer important mitigations and develop robust project plans with key milestones, detailed deliverables, robust work tracking protocols, and risk mitigation strategies.
  • Demonstrate proficiency in extracting, cleaning, and transforming CBP transactional and mission data associated within an identified problem space to build predictive models as well as develop appropriate supporting documentation.
  • Leverage expert knowledge of a variety of statistical and machine learning techniques and methods to define and develop programming algorithms; train, evaluate, and deploy predictive analytics models that directly inform mission decisions.
  • Execute projects including those intended to identify patterns and/or anomalies in large datasets; perform automated text/data classification and categorization as well as entity recognition, resolution and extraction; and named entity matching.
  • Brief project management, technical design, and outcomes to both technical and non-technical audiences including senior government stakeholders throughout the model development/ project lifecycle through written as well as in-person reporting.
Required Qualifications
  • 7-12 years of relevant experience
  • Experience in applying advanced analytics solutions to solve complex business problems
  • Experience with programming languages including: R, Python, JavaScript, Visual Basic
  • Experience with creating VBA applications and macros to structure, manage, and wrangle key datasets
  • Experience with core data science libraries – Pandas, NumPy, Matplotlib, Plotly, etc.
  • Experience with Anaconda distribution of Python for package management and deployment
  • Familiarity with command-line shell programming (Powershell, cmd, etc.)
  • Proficiency with SQL programming
  • Familiarity with RESTful APIs, web scraping, and processing unstructured data
  • Knowledge of visualization and presentation techniques including Tableau, Power BI, Jupyter Notebooks, etc.
  • Knowledge of cloud technologies such as AWS or Google
  • Proficiency using git for version control, collaboration, and code review
  • Familiarity with software organization tools and frameworks (Docker, virtual environments, etc.)
  • Experience with engineering and development collaboration tools such as Jira and Confluence.
  • Experience with Natural Language Processing (NLP), computational linguistics, Entity extraction, named entity recognition (NER), name matching, disambiguation,).
  • Experience constructing and executing queries to extract data in support of EDA and model development
  • Experience with unsupervised and supervised machine learning techniques and methods
  • Experience working with large-scale (e.g., terabyte and petabyte) unstructured and structured data sets and databases
  • Experience performing data mining, analysis, and training set construction
  • Selected applicants must be a US Citizen and able to obtain and maintain a U.S. Customs and Border Protection (CBP) suitability.

  • The status of applicable COVID-19 vaccination requirements under Executive Order 14042 are subject to change depending on applicable court orders and the course of ongoing litigation. Candidates may be required to show proof of COVID-19 vaccination or have an approved exemption
Preferred Qualifications
  • Proficiency with Unsupervised Machine Learning methods including Cluster Analysis (e.g., K-means, K-nearest Neighbor, Hierarchical, Deep Belief Networks, Principal Component Analysis), Segmentation, etc.
  • Proficiency with Supervised Machine Learning methods including Decision Trees, Support Vector Machines, Logistic Regression, Random/Rotation Forests, Categorization/Classification, Neural Nets, Bayesian Networks, etc.
  • Experience with pattern recognition and extraction, automated classification, and categorization
  • Experience with entity resolution (e.g., record linking, named-entity matching, deduplication/ disambiguation)
  • Experience with visualization tools and techniques (e.g., Periscope, Business Objects, D3, ggplot, Tableau, SAS Visual Analytics, PowerBI)
  • Experience with big data technologies (e.g., Hadoop, HIVE, HDFS, HBase, MapReduce, Spark, Kafka, Sqoop)
  • Master’s Degree in mathematics, statistics, computer science/engineering, or other related technical fields with equivalent practical experience
  • Active CBP Background Investigation
  • Active Top Secret
Company EEO Statement

Avantus Federal is an equal opportunity workplace and a Vietnam Era Veterans Readjustment Assistance Act (VEVRAA) federal contractor. All qualified applicants receive consideration for employment without regard to race, religion, color, age, gender identity, sexual orientation, national origin, ancestry, citizenship status, physical or mental disability, medical condition, pregnancy, marital or veteran status, as protected by applicable law. If you have a disability or special need that requires accommodation, please let us know by requesting an accommodations application. Avantus encourages members of historically underrepresented communities to apply and hires individuals solely based on their qualifications for the role. We strongly commit to embracing diversity and ensuring equal employment opportunities for all.

This job was posted on Sun Jan 15 2023 and expired on Mon Jan 30 2023.
avatar-of-creator

Data Scientist Interview Questions & Answers

What is the difference between supervised and unsupervised learning?

Answer

Supervised learning uses labeled data to make predictions, while unsupervised learning finds patterns in unlabeled data.

avatar-of-creator

About the Data Scientist role

Computer Research Scientists Data Scientist

As data scientists, we detect data analytics-related errors to help our organization succeed. Our job also obliges us to gather large sets of structured and unstructured data from different sources and to proofread the collected data to ensure validity, correctness, and completeness. We also analyze the data to discover solutions to the persisting problems as well as to find newer opportunities for the organizations. Other than this, we also communicate this data to the people who need it for the decision-making processes. Lastly, creating clear reports that tell how customers interact with the business is also our responsibility.

Core tasks:

  • applying models on the larger sets of data
  • analyzing data to identify current trends
  • simplifying the data problems
135 Data Scientist jobs in Ashburn, VA
See more jobs
puzzle icon
Learn more about Data Scientist job title
Similar jobs in the area

Similar jobs