1. 03 Dec, 2025 1 commit
    • LiangLiu's avatar
      multi-person & animate & podcast (#554) · f7cdbcb5
      LiangLiu authored
      
      
      - 服务化功能新增(前端+后端):
      1、seko-talk 模型支持多人输入
      2、支持播客合成与管理
      3、支持wan2.2 animate 模型
      
      - 后端接口新增:
      1、 基于火山的播客websocket合成接口,支持边合成边听
      2、播客的查询管理接口
      3、基于 yolo 的多人人脸检测接口
      4、音频多人切分接口
      
      - 推理代码侵入式修改
      1、将 animate 相关的 输入文件路径(mask/image/pose等)从固定写死的config中移除到可变的input_info中
      2、animate的预处理相关代码包装成接口供服务化使用
      
      @xinyiqin
      
      ---------
      Co-authored-by: default avatarqinxinyi <qxy118045534@163.com>
      f7cdbcb5
  2. 10 Oct, 2025 1 commit
    • LiangLiu's avatar
      deploy update (#355) · 5ffdbeb6
      LiangLiu authored
      1、frontend vue+vite
      2、share task & template
      3、x264 rtc stream push
      5ffdbeb6
  3. 22 Sep, 2025 1 commit
  4. 16 Sep, 2025 1 commit
  5. 03 Sep, 2025 1 commit
    • LiangLiu's avatar
      Deploy server and worker (#284) · ab1b2790
      LiangLiu authored
      
      
      * Init deploy: not ok
      
      * Test data_manager & task_manager
      
      * pipeline is no need for worker
      
      * Update worker text_encoder
      
      * deploy: submit task
      
      * add apis
      
      * Test pipelineRunner
      
      * Fix pipeline
      
      * Tidy worker & test PipelineWorker ok
      
      * Tidy code
      
      * Fix multi_stage for wan2.1 t2v & i2v
      
      * api query task, get result & report subtasks failed when workers stop
      
      * Add model list functionality to Pipeline and API
      
      * Add task cancel and task resume  to API
      
      * Add RabbitMQ queue manager
      
      * update local task manager atomic
      
      * support postgreSQL task manager, add lifespan async init
      
      * worker print -> logger
      
      * Add S3 data manager, delete temp objects after finished.
      
      * fix worker
      
      * release fetch queue msg when closed, run stuck worker in another thread, stop worker when process down.
      
      * DiTWorker run with thread & tidy logger print
      
      * Init monitor without test
      
      * fix monitor
      
      * github OAuth and jwt token access & static demo html page
      
      * Add user to task, ok for local task manager & update demo ui
      
      * sql task manager support users
      
      * task list with pages
      
      * merge main fix
      
      * Add proxy for auth request
      
      * support wan audio
      
      * worker ping subtask and ping life, fix rabbitmq async get,
      
      * s3 data manager with async api & tidy monitor config
      
      * fix merge main & update req.txt & fix html view video error
      
      * Fix distributed worker
      
      * LImit user visit freq
      
      * Tidy
      
      * Fix only rank save
      
      * Fix audio input
      
      * Fix worker fetch None
      
      * index.html abs path to rel path
      
      * Fix dist worker stuck
      
      * support publish output video to rtmp & graceful stop running dit step or segment step
      
      * Add VAReader
      
      * Enhance VAReader with torch dist
      
      * Fix audio stream input
      
      * fix merge refractor main, support stream input_audio and output_video
      
      * fix audio read with prev frames & fix end take frames & tidy worker end
      
      * split audio model to 4 workers & fix audio end frame
      
      * fix ping subtask with queue
      
      * Fix audio worker put block & add whep, whip without test ok
      
      * Tidy va recorder & va reader log, thread canel within 30s
      
      * Fix dist worker stuck: broadcast stop signal
      
      * Tidy
      
      * record task active_elapse & subtask status_elapse
      
      * Design prometheus metrics
      
      * Tidy prometheus metrics
      
      * Fix merge main
      
      * send sigint to ffmpeg process
      
      * Fix gstreamer pull audio by whep & Dockerfile for gstreamer & check params when submitting
      
      * Fix merge main
      
      * Query task with more info & va_reader buffer size = 1
      
      * Fix va_recorder
      
      * Add config for prev_frames
      
      * update frontend
      
      * update frontend
      
      * update frontend
      
      * update frontend
      merge
      
      * update frontend & partial backend
      
      * Different rank for va_recorder and va_reader
      
      * Fix mem leak: only one rank publish video, other rank should pop gen vids
      
      * fix task category
      
      * va_reader pre-alloc tensor & va_recorder send frames all & fix dist cancel infer
      
      * Fix prev_frame_length
      
      * Tidy
      
      * Tidy
      
      * update frontend & backend
      
      * Fix lint error
      
      * recover some files
      
      * Tidy
      
      * lint code
      
      ---------
      Co-authored-by: default avatarliuliang1 <liuliang1@sensetime.com>
      Co-authored-by: default avatarunknown <qinxinyi@sensetime.com>
      ab1b2790