About Nexusflow.ai
Modern enterprise copilots & agents call for last-mile quality, enterprise-grade robustness and scalable operation costs, beyond simplified programming interfaces for generative AI. Nexusflow tackles this challenge, enabling enterprises to own their workflow copilots & agents stacked on top of powerful yet cost-effective, compact LLMs. We train large language models and build last-mile quality dev tooling for copilots & agents on your enterprise workflows. Our team has built the open-source LLM, NexusRaven-V2, rivaling GPT-4 in function calling with a 100X smaller model size. Our team members are also behind the scenes of Starling, the #1 ranked compact 7B chat model based on human evaluation in Chatbot Arena.
Position: Backend Engineer
Nexusflow is currently adding Backend Engineers to our team. Our Backend Engineers package up our technology in models and last-mile quality tooling. Our Backend Engineers will be the driving force to build our products and solutions, in extensive collaboration with our ML Engineers and Front-end Engineers.
API system development for copilot & agent quality tooling
API system development for copilot serving and integration with a focus on enterprise-grade requirements in the following areas
Integration with on-prem & cloud compute vendors
Integration with software tools required in customer oriented solutions
Distributed system and optionally GPU performance optimization
Wear many hats and collaborate with the whole team for product development, deployment and customer success
Experience in ML model or ML data pipeline deployment (on-prem or on cloud)
Experience in building backend for application or platform API systems
Experience in using or contributing to modern compute frameworks for LLMs (e.g. Deepspeed, Huggingface TGI, FSDP)
Experience in projects involving LLMs