Machine Learning Engineer - Language Models

Melwy-company-logo
Machine Learning Engineer - Language Models
Melwy
Software Engineer
Central and Western, Hong Kong
7 days ago
Full Time
Onsite
Technology, Information and Media
Job Description
16 days ago
About Us
We're an agile team pushing the boundaries of AI technology. Our mission is to revolutionize the field of large language models (LLMs) and make advanced AI more open and accessible to all.

- We're passionate, innovative, and thrive on collaborative problem-solving
- We embrace a culture of continuous learning and intellectual curiosity
- Our team consists of diverse, talented individuals from around the globe
- We offer a fully remote work environment, allowing you to work from anywhere

Role Summary
As an LLM Research Engineer, you'll be at the forefront of our AI development efforts:

- Drive the development and optimization of our cutting-edge LLM systems
- Collaborate closely with our research team to accelerate breakthroughs
- Contribute to the entire LLM lifecycle, from conceptualization to deployment

Key Responsibilities
- Develop and implement state-of-the-art LLM architectures and training methods
- Design and execute experiments to push the boundaries of LLM capabilities
- Create efficient, scalable code for LLM training and inference
- Build tools and infrastructure to support rapid prototyping and research
- Bridge the gap between research concepts and practical applications
- Stay current with the latest advancements in LLM research and contribute to the field

Qualifications & Profile
- PhD or Postdocs in Machine Learning, Computer Sciences, Numerical Analysis, Functional Analysis, Signal Processing, Control Theory, Statistics, Dynamical Systems, Mathematics, Statistical Physics, Neurosciences, or other quantitative fields, or equivalent experiences in industry
- Big Tech experience is a plus (Google, Microsoft, Meta, Baidu...)
- Strong background in deep learning, natural language processing, and LLMs
- Experience with LLM frameworks (e.g., PyTorch, TensorFlow)
- Proficiency in CUDA (for Nvidia GPUs), XLA (for Google TPUs), or AWS Neuron (for AWS Inferentia/Trainium)
- Familiarity with distributed computing and large-scale model training
- Track record of contributions to research projects or publications in the field of AI/ML
- Self-motivated with the ability to work independently in a remote environment with minimal supervision
- Strong online communication skills and a collaborative mindset
- Passion for pushing the boundaries of AI technology

What We Offer
- Opportunity to work on groundbreaking LLM technology
- Fully remote work environment with flexible hours
- Competitive salary and equity package
- Regular virtual team-building events and knowledge-sharing sessions
- Support for continued learning and professional development
- Chance to make a significant impact in a rapidly growing field

Interested candidates should submit their resume, which includes their project portfolio and publication list.

We are an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
Share to
More jobs like this
Goodnotes-company-logo
Machine Learning Engineer, LLM
Goodnotes
Central and Western, Hong Kong
BIGO-company-logo
Large Language Model Algorithm Engineer
BIGO
Central and Western, Hong Kong
SPRINGER Professional Group Limited-company-logo
Data engineering, Director / Sr Engineer / Engineer, Multiple openings – New Team
SPRINGER Professional Group Limited
Central and Western, Hong Kong
G2i Inc.-company-logo
Software Developer for Training AI Data (Python)
G2i Inc.
Central and Western, Hong Kong
Jane Street-company-logo
**Senior Machine Learning Architect**
Jane Street
Central and Western, Hong Kong