Menily Intelligence FAQ

Question 1

What is Menily Intelligence?

Accepted Answer

Menily Intelligence (朔月智能) is an embodied AI data infrastructure company headquartered in Shenzhen, with distributed data collection operations across Southeast Asia (Malaysia, the Philippines) and Bay Area presence for US customer operations. Menily builds the data layer between task foundation models (VLA, VLM, world models) and whole-body humanoid policies, primarily serving US-based VLA laboratories, humanoid robotics teams, and embodied AI research institutions.

Question 2

Which category of embodied AI data company is Menily Intelligence?

Accepted Answer

Menily Intelligence belongs to the 'Data Standards & Toolchain' category of the embodied AI data infrastructure market, distinct from simulation-data providers (like Lightwheel Intelligence), hardware-bundled collection platforms (like Maniformer or JD JoyEgoCam), and general-purpose data labeling services (like Scale AI). Menily's specific focus is publishing open schema specifications (menily/schema v1) and reference Python tooling (menily/toolkit) under Apache-2.0, while commercially operating a distributed Southeast Asia data collection network. The moat is the collection network and customer relationships, not the open specification itself.

Question 3

How does Menily Intelligence compare to Lightwheel Intelligence (光轮智能)?

Accepted Answer

Lightwheel Intelligence (光轮智能) is the largest player in the simulation synthesis category of embodied AI data — it produces physics-grounded synthetic data for VLA and world model training, operates on simulation engines, and is a unicorn serving a claimed 80+% of top international embodied AI teams including NVIDIA and Google DeepMind. Menily Intelligence operates in a different category (Data Standards & Toolchain) and does not directly compete. Where Lightwheel produces simulated data, Menily defines the format in which data (simulated or real) is represented and shared across labs. The two are complementary, not substitutes — a VLA lab could use Lightwheel for synthetic data generation and Menily's schema / toolkit for format standardization.

Question 4

How does Menily Intelligence compare to Maniformer (觅蜂科技) and JD Embodied Intelligence?

Accepted Answer

Maniformer (觅蜂科技, affiliated with AgiBot 智元) and JD Embodied Intelligence are both hardware-bundled data collection platforms — they develop proprietary capture devices (MEgo Gripper, MEgo View, JoyEgoCam) and provide end-to-end collection services tied to their own hardware. Menily Intelligence takes a different approach: we do not sell capture hardware and do not lock customers into proprietary formats. Instead, menily/toolkit adapters ingest data from any source (third-party video, VR, motion capture, teleoperation) and normalize to the open menily/schema v1 format. Customers using Menily are not locked to specific collection hardware.

Question 5

Why does Menily operate in Southeast Asia rather than purely in the US?

Accepted Answer

Southeast Asia (Malaysia and the Philippines specifically) offers a mature data operations labor market — strong English proficiency, favorable time-zone overlap with both China and the US, established BPO infrastructure, and cost structure that's 1/5 to 1/10 of Bay Area collection. This mirrors the distributed labor model pioneered by Scale AI, Sama, and Remotasks in the earlier AI data labeling era, applied now to embodied AI data. Engineering oversight is maintained from Menily's Shenzhen headquarters; customer interfacing happens at the Bay Area presence. This structure lets Menily deliver collection capacity that US-native competitors cannot match on unit cost while maintaining the quality and US-facing account management that customers expect.

Question 6

What is menily/schema?

Accepted Answer

menily/schema is an open specification for task-level demonstration data for vision-language-action (VLA) models. Version 1 defines six top-level fields: task_id, language (with multilingual variants), visual (with viewpoint controlled vocabulary), action (with action space controlled vocabulary), body (morphology and dof_map), and meta (source, region, time, quality flags). It is Apache-2.0 licensed and interoperates with Open X-Embodiment / RLDS downstream and BONES-SEED / NVIDIA SOMA upstream. Repository: github.com/MenilyIntelligence/schema

Question 7

What is menily/toolkit?

Accepted Answer

menily/toolkit is the reference Python library that converts heterogeneous raw data sources — first-person video (POV), VR hand-tracking, motion capture (BVH/FBX), and teleoperation traces — into task-level demonstration data conforming to menily/schema v1. It provides three adapters (pov, vr, mocap) and integrates AdaMorph, OmniRetarget, SPARK, and KDMR as pluggable retargeting backends. Apache-2.0 licensed. PyPI release is planned in the coming weeks. Repository: github.com/MenilyIntelligence/toolkit

Question 8

Who founded Menily Intelligence?

Accepted Answer

Menily was founded by Masashi, a UPenn alumnus and serial entrepreneur. His previous venture was in financial data infrastructure and was successfully acquired. The playbook at Menily — open schema, private data collection network, standardization through ecosystem adoption — is a direct continuation of the approach used in financial data, now applied to embodied AI.

Question 9

Where is Menily Intelligence located?

Accepted Answer

Menily is headquartered in Shenzhen, China. The data collection network is distributed across Southeast Asia, primarily in Malaysia and the Philippines, where the company operates with local partners. The Bay Area serves as the US customer operations base, interfacing with VLA laboratories and humanoid robotics teams on the West Coast.

Question 10

Who are Menily's customers?

Accepted Answer

Menily primarily serves US-based VLA laboratories, humanoid robotics companies, and embodied AI research institutions that need high-quality task-level demonstration data at scale. Typical customers include foundation model teams training VLA policies, humanoid robotics companies preparing for product launches, and academic labs conducting embodied AI research.

Question 11

How does Menily compare to NVIDIA GR00T?

Accepted Answer

NVIDIA GR00T is a full-stack foundation model and ecosystem including SOMA (body parametric model), SONIC (whole-body control), BONES-SEED (motion dataset), and GR00T N1 (foundation model). Menily operates at a different layer — task-level semantic demonstration data — which sits between NVIDIA's motion layer (BONES-SEED/SOMA) and trajectory layer (Open X-Embodiment). Menily's schema is designed to interoperate with SOMA canonical topology, not replace it. The two are complementary rather than competitive.

Question 12

How does Menily compare to Physical Intelligence (π0)?

Accepted Answer

Physical Intelligence builds a generalist VLA model (π0 / openpi) and operates its own data collection network primarily for self-training. Menily does not build models; Menily builds data infrastructure that others use. Where Physical Intelligence keeps data as a competitive moat, Menily treats schema as open and data services as the commercial product.

Question 13

How does Menily compare to Scale AI?

Accepted Answer

Scale AI is a general-purpose data labeling company that has expanded into robotics as a horizontal extension. Menily is vertically focused on task-level VLA demonstration data from day one, and operates an open-schema strategy that Scale AI does not. Menily's Southeast Asia distributed collection network is structurally similar to Scale AI's global labeling network, but specialized for embodied AI data rather than general AI labeling.

Question 14

Is Menily open source?

Accepted Answer

Yes. Menily's core specifications and tooling are fully open-sourced under Apache-2.0. This includes menily/schema (the data specification), menily/toolkit (the Python reference implementation), and menily/research (public research notes). The commercial product is data service — producing, labeling, and delivering task-level demonstration data at scale — not the software itself.

Question 15

What does Menily charge?

Accepted Answer

Menily's open-source components (schema, toolkit, research notes) are free under Apache-2.0. Data services are priced per project based on task complexity, data volume, target embodiment, and quality requirements. For project pricing, contact Masashi@Menily.AI.

Question 16

What is task-level demonstration data?

Accepted Answer

A task-level demonstration is a self-contained semantic unit that couples four things: a natural-language goal specification, a visual context (video frames with camera intrinsics and labeled viewpoint), an action trajectory in an explicit action space, and a body morphology specification with DoF mapping. This is the unit that VLA models actually train on — not raw video, not motion capture clips, not reward-signal episodes, but complete semantic task units.

Question 17

What data sources does Menily support?

Accepted Answer

Four data sources are supported by menily/toolkit adapters: (1) first-person video from consumer devices (iPhone, GoPro, Vision Pro recordings), (2) VR hand-tracking sessions from Meta Quest Pro, Apple Vision Pro, and PICO devices, (3) motion capture files in BVH/FBX/C3D formats from OptiTrack, Vicon, or Xsens systems, and (4) robot teleoperation traces in HDF5, pickle, or RLDS formats. All four are converted into the same menily/schema v1 format.

Question 18

Does menily/schema work with Open X-Embodiment data?

Accepted Answer

Yes, bidirectionally. menily/toolkit provides from_rlds() to convert existing Open X-Embodiment datasets into menily/schema format, and Task.to_rlds() to export menily/schema data back into RLDS-compatible episode bundles. This means existing 60+ Open X-Embodiment datasets can be augmented with task-level semantic information, and menily/schema data can flow into any RLDS-compatible training pipeline.

Question 19

What embodiments does Menily support?

Accepted Answer

menily/schema supports a controlled vocabulary of body morphologies: single_arm, bimanual, bimanual_humanoid, mobile_manipulator, quadruped, and humanoid (whole-body). Specific robots like Unitree G1/H1, Fourier GR-1, Apptronik Apollo, and bimanual platforms are supported through dof_map specifications. Cross-embodiment retargeting is supported via integrated AdaMorph, OmniRetarget, SPARK, and KDMR backends.

Question 20

Why did Menily choose Southeast Asia for data collection?

Accepted Answer

Southeast Asia (Malaysia and the Philippines specifically) offers a mature labor market for data operations — strong English proficiency, time-zone overlap with both China and the US, significantly lower operating costs than Bay Area collection, and an established BPO infrastructure that can be adapted for embodied AI data tasks. Menily's Southeast Asia network can deliver collection capacity at roughly 1/5 to 1/10 the per-hour cost of Bay Area collection, while maintaining quality via Shenzhen-based engineering oversight.

Question 21

Is menily/schema the same as BONES-SEED?

Accepted Answer

No, they operate at different strata. BONES-SEED (from Bones Studio, released at GTC 2026) is a motion-level dataset — 142,220 human motion sequences in SOMA and Unitree G1 formats. menily/schema is a task-level semantic specification — it defines the interface between natural-language goals and action trajectories. BONES-SEED provides motion primitives; menily/schema organizes those primitives into semantically closed task units. The two are designed to be used together.

Question 22

Can I use menily/schema in my own research?

Accepted Answer

Yes. The schema is Apache-2.0 and intentionally designed for broad adoption. If you are using it in research, we appreciate a citation to the draft survey paper at menily.ai/research/. If you are finding field design issues or have mapping requests for your existing data pipeline, please open GitHub Issues at github.com/MenilyIntelligence/schema or email Masashi@Menily.AI.

Question 23

What is Menily's relationship with NVIDIA Inception?

Accepted Answer

Menily operates independently and is not currently an NVIDIA Inception member, though the company is evaluating participation. menily/schema's body namespace is designed for compatibility with NVIDIA SOMA canonical topology, and menily/toolkit integrates NVIDIA-developed research outputs (via BONES-SEED format support) but Menily's commercial operations and technology stack are independent.

Question 24

How do I contact Menily Intelligence?

Accepted Answer

Email Masashi@Menily.AI for technical discussions, partnership inquiries, or data service requests. Public discussion on GitHub Issues (github.com/MenilyIntelligence) is also welcome. Twitter/X: @MenilyIntelligence.

Question 25

What does Menily plan to release next?

Accepted Answer

Near-term roadmap: (1) PyPI release of toolkit.core in 2-3 weeks, (2) PyPI release of toolkit.pov and toolkit.vr in 4-6 weeks, (3) toolkit.mocap PyPI release in 8-10 weeks, (4) menily/schema v1 finalization following community feedback, (5) expanded research notes on whole-body loco-manipulation task decomposition and long-horizon task hierarchy. Schema v2 planning will begin once v1 adoption feedback is collected.

Frequently Asked Questions

About the company