Is Melwy hiring Machine Learning Engineer - Language Models?

Yes, Melwy is hiring Machine Learning Engineer - Language Models. Apply now at TechJobAsia

Machine Learning Engineer - Language Models Opening at Melwy in Central and Western, Hong Kong, China

Find Jobs

News & Insight

Machine Learning Engineer - Language Models

Melwy

Software Engineer

Central and Western, Hong Kong, China

7 days ago

Full Time

Onsite

Retail

Job Description

7 days ago

About Us
We're an agile team pushing the boundaries of AI technology. Our mission is to revolutionize the field of large language models (LLMs) and make advanced AI more open and accessible to all.

- We're passionate, innovative, and thrive on collaborative problem-solving
- We embrace a culture of continuous learning and intellectual curiosity
- Our team consists of diverse, talented individuals from around the globe
- We offer a fully remote work environment, allowing you to work from anywhere

Role Summary
As an LLM Research Engineer, you'll be at the forefront of our AI development efforts:

- Drive the development and optimization of our cutting-edge LLM systems
- Collaborate closely with our research team to accelerate breakthroughs
- Contribute to the entire LLM lifecycle, from conceptualization to deployment

Key Responsibilities
- Develop and implement state-of-the-art LLM architectures and training methods
- Design and execute experiments to push the boundaries of LLM capabilities
- Create efficient, scalable code for LLM training and inference
- Build tools and infrastructure to support rapid prototyping and research
- Bridge the gap between research concepts and practical applications
- Stay current with the latest advancements in LLM research and contribute to the field

Qualifications & Profile
- PhD or Postdocs in Machine Learning, Computer Sciences, Numerical Analysis, Functional Analysis, Signal Processing, Control Theory, Statistics, Dynamical Systems, Mathematics, Statistical Physics, Neurosciences, or other quantitative fields, or equivalent experiences in industry
- Big Tech experience is a plus (Google, Microsoft, Meta, Baidu...)
- Strong background in deep learning, natural language processing, and LLMs
- Experience with LLM frameworks (e.g., PyTorch, TensorFlow)
- Proficiency in CUDA (for Nvidia GPUs), XLA (for Google TPUs), or AWS Neuron (for AWS Inferentia/Trainium)
- Familiarity with distributed computing and large-scale model training
- Track record of contributions to research projects or publications in the field of AI/ML
- Self-motivated with the ability to work independently in a remote environment with minimal supervision
- Strong online communication skills and a collaborative mindset
- Passion for pushing the boundaries of AI technology

What We Offer
- Opportunity to work on groundbreaking LLM technology
- Fully remote work environment with flexible hours
- Competitive salary and equity package
- Regular virtual team-building events and knowledge-sharing sessions
- Support for continued learning and professional development
- Chance to make a significant impact in a rapidly growing field

Interested candidates should submit their resume, which includes their project portfolio and publication list.

We are an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Share to