保护隐私的无服务器边缘学习与分散的小数据
在过去的十年中,数据驱动的算法在许多研究领域胜过了传统的基于优化的算法,如计算机视觉、自然语言处理等。然而,广泛的数据使用给深度学习算法带来了新的挑战,甚至是威胁,即隐私保护。分布式训练策略最近成为一种有前途的方法,以确保训练深度模型时的数据隐私。本文用无服务器边缘学习架构扩展了传统的无服务器平台,并从网络角度提供了一个高效的分布式训练框架。该框架在异构物理单元之间动态地协调可用资源,以有效地实现深度学习目标。该设计共同考虑了学习任务请求和底层基础设施的异质性,包括最后一英里的传输、移动设备的计算能力、边缘和云计算中心以及设备电池状态。此外,为了大大减少分布式训练的开销,通过与一般的、简单的数据分类器整合,提出了小规模的数据训练。这种低负荷的增强可以与各种分布式深度模型无缝对接,以提高训练阶段的通信和计算效率。最后,开放的挑战和未来的研究方向鼓励研究界开发高效的分布式深度学习技术。
原文题目:Privacy-Preserving Serverless Edge Learning with Decentralized Small Data
原文:In the last decade, data-driven algorithms outperformed traditional optimization-based algorithms in many research areas, such as computer vision, natural language processing, etc. However, extensive data usages bring a new challenge or even threat to deep learning algorithms, i.e., privacy-preserving. Distributed training strategies have recently become a promising approach to ensure data privacy when training deep models. This paper extends conventional serverless platforms with serverless edge learning architectures and provides an efficient distributed training framework from the networking perspective. This framework dynamically orchestrates available resources among heterogeneous physical units to efficiently fulfill deep learning objectives. The design jointly considers learning task requests and underlying infrastructure heterogeneity, including last-mile transmissions, computation abilities of mobile devices, edge and cloud computing centers, and devices battery status. Furthermore, to significantly reduce distributed training overheads, small-scale data training is proposed by integrating with a general, simple data classifier. This low-load enhancement can seamlessly work with various distributed deep models to improve communications and computation efficiencies during the training phase. Finally, open challenges and future research directions encourage the research community to develop efficient distributed deep learning techniques.
相关文章
- 金融服务领域的大数据:即时分析
- 影响大数据、机器学习和人工智能未来发展的8个因素
- 从0开始构建一个属于你自己的PHP框架
- 如何将Hadoop集成到工作流程中?这6个优秀实践必看
- SEO公司使用大数据优化其模型的5种方法
- 关于Web Workers你需要了解的七件事
- 深入理解HTTPS原理、过程与实践
- 增强分析:数据和分析的未来
- PHP协程实现过程详解
- AI专家:大数据知识图谱——实战经验总结
- 关于PHP的错误机制总结
- 利用数据分析量化协同过滤算法的两大常见难题
- 怎么做大数据工作流调度系统?大厂架构师一语点破!
- 2019大数据处理必备的十大工具,从Linux到架构师必修
- OpenCV中的KMeans算法介绍与应用
- 教大家如果搭建一套phpstorm+wamp+xdebug调试PHP的环境
- CentOS下三种PHP拓展安装方法
- Go语言HTTP Server源码分析
- Go语言HTTP Server源码分析
- 2017年4月编程语言排行榜:Hack首次进入前五十