METHOD OF PROVIDING MODEL SERVICES

Number of patents in Portfolio can not be more than 2000

United States of America

APP PUB NO 20240419991A1
SERIAL NO

18747725

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method is provided that includes: creating a plurality of first model instances of a first service model to be deployed; allocating an inference service for each of a plurality of first model instances from the plurality of inference services; calling, for each first model instance, a loading interface of the inference service allocated for the first model instance to mount a weight file; determining, in response to a user request for a target service model, a target model instance from a plurality of model instances of the target service model to respond to the user request; and calling a target inference service allocated for the target model instance to use computing resources configured for the target inference service to run, in the target model instance, a base model mounted with a target weight file, and obtain a request result of the user request.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO LTD2/F BAIDU CAMPUS NO 10 SHANGDI 10TH STREET HAIDIAN DISTRICT BEIJING 100085 100085

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
CHU, Zhenfang BEIJING, CN 5 0
HU, Mingren BEIJING, CN 8 5
HUANG, Yue BEIJING, CN 82 604
LI, Jinqi BEIJING, CN 24 3
LUO, Yang BEIJING, CN 93 348
QIAN, Yang BEIJING, CN 6 10
QIAN, Zhengyu BEIJING, CN 10 4
SHI, En BEIJING, CN 20 1
WANG, Guobin BEIJING, CN 7 14
WANG, Kuan BEIJING, CN 10 29
YUAN, Zhengxiong BEIJING, CN 5 0

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation