本文提供兩種爬取天貓商品數據的技術方案,并引入輕易云集成平臺以優化數據處理流程:
注冊開發者賬號:open.taobao.com
申請接口權限:taobao.item.get:商品詳情 taobao.items.search:商品搜索
安裝SDK:
pip install alibabacloud-python-sdk-core
簽名請求示例:
from alibabacloud_tea_openapi import models as open_api_models from alibabacloud_tbk_dg import models as tbk_dg_models from alibabacloud_tbk_dg.client import Client config = open_api_models.Config( app_key="your_app_key", app_secret="your_app_secret", region_id="cn-hangzhou") client = Client(config) request = tbk_dg_models.TbkItemInfoGetRequest( num_iid=654321 # 商品ID) response = client.tbk_item_info_get(request) print(response.body)
GET https://h5api.m.taobao.com/h5/mtop.taobao.detail.getdetail/6.0/
{ "itemNumId": "654321", "isForbidBuyItem": false, "isForbidPromotion": false, "sellerId": "123456"}
import requests import execjs # JS解密邏輯(需根據實際逆向代碼調整) with open('sign.js', 'r') as f: sign_js = f.read() ctx = execjs.compile(sign_js) sign = ctx.call('generate_sign', params) headers = { 'User-Agent': 'Mozilla/5.0 (iPhone; CPU iPhone OS 14_6 like Mac OS X) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/14.1.1 Mobile/15E148 Safari/604.1', 'Cookie': 't=your_cookie; cna=xxx;'} url = f'https://h5api.m.taobao.com/h5/mtop.taobao.detail.getdetail/6.0/?sign={sign}&data={data}' response = requests.get(url, headers=headers) print(response.json())
from requests_rotating_proxy import RotatingProxyManager proxy_manager = RotatingProxyManager([ 'http://user:pass@proxy1.com:8080', 'http://user:pass@proxy2.com:8080']) response = requests.get(url, proxies=proxy_manager.get_proxy())
import time import random # 隨機延遲(0.5-2秒) time.sleep(random.uniform(0.5, 2.0)) # 模擬真實用戶行為 # 可結合selenium/playwright添加鼠標移動、點擊等操作
data = response.json() item_info = { '商品ID': data['itemNumId'], '標題': data['title'], '價格': data['price']['viewPrice'], '銷量': data['soldQuantity'], '評論數': data['reviewCount']}
通過引入輕易云集成平臺,無論是官方API方案還是非官方接口實戰,都能在數據爬取、處理、存儲和分析的全流程中獲得更高效、更合規、更穩定的解決方案,為企業和開發者在天貓商品數據領域的需求提供有力支持。其他電商平臺對接也是如此,根據API規則和電商平臺規則進行接入即可。不過輕易云還是建議電商企業購買成熟的電商系統吉客云、旺店通、聚水潭、管易云等,因為成熟的電商系統都已經對接好了80%的電商平臺。無需一個一個電商平臺進行對接,也可以給企業減少不少的開發成本。
原文轉載自:https://mp.weixin.qq.com/s/5RnlizLfyNAZMi1OOnCE-g