Skip to content

zhixin612/papernotes-scheduling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

59 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Paper notes - Scheduling & Networked Systems

Materials

To read (learn)


2022.11

2022.10

  • InferLine: latency-aware provisioning and scaling for prediction serving pipelines

    • Problem

    • Insight

    • Solution

    • Other

  • [Spot instance] Tributary: spot-dancing for elastic services with latency SLOs

    • Transient Instance (AWS Spot Instance)
    • Trace: ClarkNet & WITS & ...
  • [Spot instance] Cocktail: A Multidimensional Optimization for Model Serving in Cloud

    • Ensemble Learning
    • Transient Instance
    • "DeepAR-estimator"
    • Trace: Wikipedia & tweet

2022.09

  • Twine: A Unified Cluster Management System for Shared Infrastructure
  • Shard Manager: A Generic Shard Management Framework for Geo-distributed Applications
  • Autopilot: workload autoscaling at Google
  • Piccolo: ---

2022


Template:

About

Summaries and notes on GPU Scheduling research papers

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published