當前位置：首頁 > 编程语言 > python >内容正文

python

asyncio并发数_Python Futures并发编程详解

發布時間：2025/4/5 python 38 豆豆

生活随笔收集整理的這篇文章主要介紹了 asyncio并发数_Python Futures并发编程详解小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

無論哪門編程語言，并發編程都是一項很常用很重要的技巧。例如，爬蟲就被廣泛應用在工業界的各個領域，我們每天在各個網站、各個 App 上獲取的新聞信息，很大一部分便是通過并發編程版的爬蟲獲得。正確合理地使用并發編程，無疑會給程序帶來極大的性能提升。因此，本節就帶領大家一起學習 Python 中的 Futures 并發編程。首先，先帶領大家從代碼的角度來理解并發編程中的 Futures，并進一步來比較其與單線程的性能區別。假設有這樣一個任務，要下載一些網站的內容并打印，如果用單線程的方式，它的代碼實現如下所示(為了突出主題，對代碼做了簡化，忽略了異常處理)：

import requestsimport timedef download_one(url): resp = requests.get(url) print('Read {} from {}'.format(len(resp.content), url)) def download_all(sites): for site in sites: download_one(site)def main(): sites = [ 'http://c.biancheng.net', 'http://c.biancheng.net/c', 'http://c.biancheng.net/python' ] start_time = time.perf_counter() download_all(sites) end_time = time.perf_counter() print('Download {} sites in {} seconds'.format(len(sites), end_time - start_time)) if __name__ == '__main__': main()

輸出結果為：

Read 52053 from http://c.biancheng.net
Read 30718 from http://c.biancheng.net/c
Read 34470 from http://c.biancheng.net/python
Download 3 sites in 0.3537296 seconds

注意，此程序中，requests 模塊需單獨安裝，可通過執行 pip install requests 命令進行安裝。

這種方式應該是最直接也最簡單的：

先是遍歷存儲網站的列表；
然后對當前網站執行下載操作；
等到當前操作完成后，再對下一個網站進行同樣的操作，一直到結束。

可以看到，總共耗時約 0.35s。單線程的優點是簡單明了，但是明顯效率低下，因為上述程序的絕大多數時間都浪費在了 I/O 等待上。程序每次對一個網站執行下載操作，都必須等到前一個網站下載完成后才能開始。如果放在實際生產環境中，我們需要下載的網站數量至少是以萬為單位的，不難想象，這種方案根本行不通。接著再來看多線程版本的代碼實現：

import concurrent.futuresimport requestsimport threadingimport timedef download_one(url): resp = requests.get(url) print('Read {} from {}'.format(len(resp.content), url))def download_all(sites): with concurrent.futures.ThreadPoolExecutor(max_workers=5) as executor: executor.map(download_one, sites)def main(): sites = [ 'http://c.biancheng.net', 'http://c.biancheng.net/c', 'http://c.biancheng.net/python' ] start_time = time.perf_counter() download_all(sites) end_time = time.perf_counter() print('Download {} sites in {} seconds'.format(len(sites), end_time - start_time))if __name__ == '__main__': main()

運行結果為：

Read 52053 from http://c.biancheng.net
Read 30718 from http://c.biancheng.net/c
Read 34470 from http://c.biancheng.net/python
Download 3 sites in 0.1606366 seconds

可以看到，總耗時是 0.2s 左右，效率一下子提升了很多。

注意，雖然線程的數量可以自己定義，但是線程數并不是越多越好，因為線程的創建、維護和刪除也會有一定的開銷，所以如果設置的很大，反而可能會導致速度變慢。我們往往需要根據實際的需求做一些測試，來尋找最優的線程數量。

上面兩段代碼中，多線程版本和單線程版的主要區別在于如下代碼：

with concurrent.futures.ThreadPoolExecutor(max_workers=5) as executor:

executor.map(download_one, sites)

這里創建了一個線程池，總共有 5 個線程可以分配使用。executer.map() 與前面所講的 Python 內置的 map() 函數類似，表示對 sites 中的每一個元素并發地調用函數 download_one()。在 download_one() 函數中使用的 requests.get() 方法是線程安全的，在多線程的環境下也可以安全使用，不會出現條件競爭(多個線程同時競爭使用同一資源)的情況。當然，也可以用并行的方式去提高程序運行效率，只需要在 download_all() 函數中做出下面的變化即可：

with futures.ThreadPoolExecutor(workers) as executor

#=>

with futures.ProcessPoolExecutor() as executor:

這部分代碼中，函數 ProcessPoolExecutor() 表示創建進程池，使用多個進程并行的執行程序。不過，這里通常省略參數 workers，因為系統會自動返回 CPU 的數量作為可以調用的進程數。但是，并行的方式一般用在 CPU heavy 的場景中，因為對于 I/O heavy 的操作，多數時間都會用于等待，相比于多線程，使用多進程并不會提升效率。反而很多時候，因為 CPU 數量的限制，會導致其執行效率不如多線程版本。

什么是Futures？

Python Futures 模塊，位于 concurrent.futures 和 asyncio 中，它們都表示帶有延遲的操作。Futures 會將處于等待狀態的操作包裹起來放到隊列中，這些操作的狀態隨時可以查詢，當然它們的結果(或是異常)也能夠在操作完成后被獲取。通常來說，用戶不用考慮如何去創建 Futures，這些 Futures 底層都會幫我們處理好，唯一要做的只是去設定這些 Futures 的執行。比如，Futures 中的 Executor 類，當執行 executor.submit(func) 時，它便會安排里面的 func() 函數執行，并返回創建好的 future 實例，以便之后查詢調用。這里再介紹一些常用的函數。比如 Futures 中的方法 done()，表示相對應的操作是否完成，返回 True 表示完成；返回 False 表示沒有完成。不過要注意的是，done() 是非阻塞的，會立即返回結果。相對應的 add_done_callback(fn)，則表示 Futures 完成后，相對應的參數函數 fn 會被通知并執行調用。Futures 中還有一個重要的函數 result()，它表示當 future 完成后，返回其對應的結果或異常。而 as_completed(fs)，則是針對給定的 future 迭代器 fs，在其完成后返回完成后的迭代器。所以，上述例子也可以寫成下面的形式：

import concurrent.futuresimport requestsimport timedef download_one(url): resp = requests.get(url) print('Read {} from {}'.format(len(resp.content), url))def download_all(sites): with concurrent.futures.ThreadPoolExecutor(max_workers=5) as executor: to_do = [] for site in sites: future = executor.submit(download_one, site) to_do.append(future) for future in concurrent.futures.as_completed(to_do): future.result()def main(): sites = [ 'http://c.biancheng.net', 'http://c.biancheng.net/c', 'http://c.biancheng.net/python' ] start_time = time.perf_counter() download_all(sites) end_time = time.perf_counter() print('Download {} sites in {} seconds'.format(len(sites), end_time - start_time))if __name__ == '__main__': main()

運行結果為：

Read 52053 from http://c.biancheng.net
Read 34470 from http://c.biancheng.net/python
Read 30718 from http://c.biancheng.net/c
Download 3 sites in 0.2275894 seconds

此程序中，首先調用 executor.submit()，將下載每一個網站的內容都放進 future 隊列 to_do 等待執行。然后是 as_completed() 函數在 future 完成后便輸出結果。不過，這里要注意，future 列表中每個 future 完成的順序和它在列表中的順序并不一定完全一致。到底哪個先完成、哪個后完成，取決于系統的調度和每個 future 的執行時間。

《新程序員》：云原生和全面數字化實踐50位技術專家共同創作，文字、視頻、音頻交互閱讀

總結

以上是生活随笔為你收集整理的asyncio并发数_Python Futures并发编程详解的全部內容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯，歡迎將生活随笔推薦給好友。

上一篇：息烽小寨坝附近租房一室一厅500元一个月
下一篇： python使用字典实现switch_p

日韩av黄I国产麻豆传媒I国产91av视频在线观看I日韩一区二区三区在线看I美女国产在线I麻豆视频国产在线观看I成人黄色短片

python

asyncio并发数_Python Futures并发编程详解

什么是Futures？

總結