537
edits
Line 32: | Line 32: | ||
==List of Papers == | ==List of Papers == | ||
* | *2. Network for Distributed Learning | ||
**(1) Is Network the Bottleneck of Distributed Training? [https://dl.acm.org/doi/pdf/10.1145/3405671.3405810] Sigcomm workshop 20 | **(1) Is Network the Bottleneck of Distributed Training? [https://dl.acm.org/doi/pdf/10.1145/3405671.3405810] Sigcomm workshop 20 | ||
**(2) Domain-specific Communication Optimization for Distributed DNN Training [https://arxiv.org/pdf/2008.08445.pdf] arxiv | **(2) Domain-specific Communication Optimization for Distributed DNN Training [https://arxiv.org/pdf/2008.08445.pdf] arxiv | ||
**(3) PipeDream: generalized pipeline parallelism for DNN training[https://dl-1acm-1org-12xpm0nrgacbe.han.sub.uni-goettingen.de/doi/pdf/10.1145/3341301.3359646] SOSP ’19 | **(3) PipeDream: generalized pipeline parallelism for DNN training[https://dl-1acm-1org-12xpm0nrgacbe.han.sub.uni-goettingen.de/doi/pdf/10.1145/3341301.3359646] SOSP ’19 | ||
* | *3. Network | ||
**(1) Neural Packet Routing [https://dl.acm.org/doi/pdf/10.1145/3405671.3405813] Sigcomm workshop 20 | **(1) Neural Packet Routing [https://dl.acm.org/doi/pdf/10.1145/3405671.3405813] Sigcomm workshop 20 | ||
**(2) OmniMon: Re-architecting Network Telemetry with Resource Efficiency and Full Accuracy [https://dl.acm.org/doi/pdf/10.1145/3387514.3405877] Sigcomm 20 | **(2) OmniMon: Re-architecting Network Telemetry with Resource Efficiency and Full Accuracy [https://dl.acm.org/doi/pdf/10.1145/3387514.3405877] Sigcomm 20 | ||
**(3) Swift: Delay is Simple and Effective for Congestion Control in the Datacenter[https://dl.acm.org/doi/pdf/10.1145/3387514.3406591] Sigcomm 20 | **(3) Swift: Delay is Simple and Effective for Congestion Control in the Datacenter[https://dl.acm.org/doi/pdf/10.1145/3387514.3406591] Sigcomm 20 | ||
* | *4. AI for Network | ||
**(1) SmartEntry: Mitigating Routing Update Overhead with Reinforcement Learning for Traffic Engineering [https://dl.acm.org/doi/pdf/10.1145/3405671.3405813] Sigcomm workshop 20 | **(1) SmartEntry: Mitigating Routing Update Overhead with Reinforcement Learning for Traffic Engineering [https://dl.acm.org/doi/pdf/10.1145/3405671.3405813] Sigcomm workshop 20 | ||
**(2) Event-Triggered Communication Network with Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning [https://arxiv.org/pdf/2010.04978.pdf] AAAI 21 | **(2) Event-Triggered Communication Network with Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning [https://arxiv.org/pdf/2010.04978.pdf] AAAI 21 |
edits