Back to Jobs

VP of AI

NanoNets
Palo Alto, CA, US, Remote (Palo Alto, CA, US)
Full-time
$125K–$157K
Estimated
Remote
Apply Now

Required Skills

Machine Learning
Deep Learning
Computer Vision
Transformers
Python
R
Typescript
Go
Tensorflow
Pytorch
Keras
React
Statistics
Sql
Mysql
Cassandra
Aws
Gcp
Kubernetes
Jenkins

Job Description

Nanonets is automating document information extraction using AI. We are headquartered in San Francisco. We are backed by prestigious investors from bay area like Y-Combinator, SV Angels, Sound Ventures by Ashton Kutcher. We are currently profitable and growing at a fast pace and looking to expand our team. We are building a product that lets companies automate extracting key information from documents like invoices, receipts, or any other kind of document and integrate it into their workflows saving manual work. We need to keep building features that will let users automate millions of documents of different kinds every day, feed them to our AI for learning, plug our API to external systems like salesforce, quickbooks, RPA providers etc. You should check it out at https://app.nanonets.com Nanonets has a vision to automate the most complex, time-consuming processes inside companies which take away hours of valuation time from employees. Our workflow automation platform is powered with the latest models that can make sense of even unstructured data like documents. Our client footprint spans across brands such as Toyota, Boston Scientific, Bill.com and Entergy to name a few enabling businesses across a myriad of industries to unlock the potential of their visual and textual data. Some of the technical challenges we deal with are fine-tuning SOTA VLM’s for workflow automation, using generating architectures for their superior emergent behaviour to understand the world but also constraining those architecture to reduce hallucination and generate structured workflows which can be used in completely automated manner. We recently announced a series B round of $29 million in funding by Accel and are backed by the likes of existing investors including Elevation Capital & YCombinator. This infusion of capital underscores our commitment to driving innovation and expanding our reach in delivering cutting-edge AI solutions to businesses worldwide. Read about the release here: https://www.forbes.com/sites/davidprosser/2024/03/12/why-enterprises-are-learning-to-love-nanonets-automation/?sh=6d79ec8f3ca1 https://techcrunch.com/2024/03/12/nanonets-funding-accel-india/amp/ What We Expect From You Strong Machine Learning & DL concepts. Strong command in low-level operations involved in building architectures like Transformers, Efficientnet, ViT etc., and experience in implementing those in pytorch/jax/tensorflow. Experience with the latest semi-supervised, unsupervised and few shot architectures in Deep Learning methods in NLP/CV domain Strong command in probability and statistics. Strong programming skills. Have previously shipped something of significance, either implemented some paper or made significant changes in an existing architecture etc Strong Machine Learning & DL concepts. Strong command in low-level operations involved in building architectures like Transformers, Efficientnet, ViT etc., and experience in implementing those in pytorch/jax/tensorflow. Experience with the latest semi-supervised, unsupervised and few shot architectures in Deep Learning methods in NLP/CV domain Strong command in probability and statistics. Strong programming skills. Have previously shipped something of significance, either implemented some paper or made significant changes in an existing architecture etc Ideal candidate should have the following skillset Experience building and deploying systems Experience with Theano/Torch/Caffe/Keras/Python/Tensorflow all useful Experience writing production software would be a plus The ideal candidate should have developed their own DL architectures apart from using open source architectures. Ideal candidate would have extensive experience with computer vision applications. Experience building and deploying systems Experience with Theano/Torch/Caffe/Keras/Python/Tensorflow all useful Experience writing production software would be a plus The ideal candidate should have developed their own DL architectures apart from using open source architectures. Ideal candidate would have extensive experience with computer vision applications. Interesting Projects Other Senior DL Engineers Have Completed Deployed large scale multi-modal architectures that can understand both text and images really well. Built an auto-ML platform that can automatically select best architecture, fine-tuning method based on type and amount of data. Best in the world models to process documents like invoices, receipts, passports, driving licenses, etc Hierarchical information extraction from documents. Robust modeling for the tree-like structure of sections inside sections in documents. Extracting complex tables — wrapped around tables, multiple fields in a single column, cells spanning multiple columns, tables in warped images, etc. Enabling few-shots learning by SOTA finetuning techniques. Deployed large scale multi-modal architectures that can understand both text and images really well. Built an auto-ML platform that can automatically select best architecture, fine-tuning method based on type and amount of data. Best in the world models to process documents like invoices, receipts, passports, driving licenses, etc Hierarchical information extraction from documents. Robust modeling for the tree-like structure of sections inside sections in documents. Extracting complex tables — wrapped around tables, multiple fields in a single column, cells spanning multiple columns, tables in warped images, etc. Enabling few-shots learning by SOTA finetuning techniques. Candidate should have experience working on Deep Learning with an engineering degree from a top tier institute Thinking of applying? Try our resume builder— it's free, fast, and tailored to help you stand out. Some of the interesting things our backend team has shipped Compile python code into C which could be imported into golang and then shipped as binary for on premise systems Autoscale GPU dependent services with kubernetes with a custom metric Displaying machine learning metrics in simplified ways to end users so they can act based on those metrics Building large number and variety of integrations with relatively generic interface like salesforce, quickbooks, RPA's, external databases Process large number of files in highly distributed manner in golang Compile python code into C which could be imported into golang and then shipped as binary for on premise systems Autoscale GPU dependent services with kubernetes with a custom metric Displaying machine learning metrics in simplified ways to end users so they can act based on those metrics Building large number and variety of integrations with relatively generic interface like salesforce, quickbooks, RPA's, external databases Process large number of files in highly distributed manner in golang Some of the interesting things our frontend team has shipped Ability for users to annotate documents so AI can learn which fields to extract Displaying machine learning metrics in simplified ways to end users so they can act based on those metrics Letting users build complex visual workflows around our API in our product. Let users visualize complex ML metrics in a very simple and intuitive way Ability for users to annotate documents so AI can learn which fields to extract Letting users build complex visual workflows around our API in our product. Let users visualize complex ML metrics in a very simple and intuitive way Our stack: Databases Cassandra DB Postgres/MySQL Backend Golang for API and other microservices Python for Machine learning (Tensorflow, Pytorch) Frontend React, Typescript Mobx Cloud Providers AWS GCP for ML heavy workload Monitoring/Alerting ELK for logging Prometheus for Monitoring Graphana for dashboards Orchestration Kubernetes DevOps Jenkins for CI/CD Databases Cassandra DB Postgres/MySQL Cassandra DB Postgres/MySQL Cassandra DB Postgres/MySQL Backend Golang for API and other microservices Python for Machine learning (Tensorflow, Pytorch) Golang for API and other microservices Python for Machine learning (Tensorflow, Pytorch) Golang for API and other microservices Python for Machine learning (Tensorflow, Pytorch) Frontend React, Typescript Mobx React, Typescript Mobx React, Typescript Mobx Cloud Providers AWS GCP for ML heavy workload AWS GCP for ML heavy workload AWS GCP for ML heavy workload Monitoring/Alerting ELK for logging Prometheus for Monitoring Graphana for dashboards ELK for logging Prometheus for Monitoring Graphana for dashboards ELK for logging Prometheus for Monitoring Graphana for dashboards Orchestration Kubernetes Kubernetes DevOps Jenkins for CI/CD Jenkins for CI/CD

Job Details

Employment Type

Full-time

Salary Range

$125K–$157K

Estimated

Location

Palo Alto, CA, US, Remote (Palo Alto, CA, US)

Remote Work

Remote Friendly