1. Umarnin Neman Bayanai na Farko (Basic Fetching)
openclaw fetch
Yana neman shafin yanar gizo guda ɗaya. Wannan shine matakin farko don samun abun ciki daga shafi.
Yi amfani da shi don shafukan da ba sa buƙatar ƙarin saituna ko mu'amala.
openclaw fetch --depth
Yana neman shafin yanar gizo da duk hanyoyin haɗin da ke ciki har zuwa mataki na N. Da amfani don tattara shafuka masu yawa.
Kiyaye zurfin neman ku don guje wa nauyi mai yawa akan sabar.
openclaw fetch --proxy
Yana neman shafin yanar gizo ta amfani da wakili (proxy) da aka bayar. Yana da mahimmanci don ɓoye adireshin IP ko samun damar shafukan da aka iyakance.
Yi amfani da jerin wakilai masu kyau don guje wa katange.
openclaw fetch --headers '{"User-Agent": "Mozilla/5.0"}'
Yana neman shafin yanar gizo tare da ƙarin kanun takardu na HTTP. Yana taimakawa wajen kwaikwayi burauzar gaske.
Saita 'User-Agent' don kauce wa ganowa a matsayin bot.
openclaw fetch --cookies 'key=value'
Yana neman shafin yanar gizo tare da kukis da aka bayar. Da amfani don shafukan da ke buƙatar shiga ko kiyaye zaman.
Kuna iya fitar da kukis daga burauzar ku don amfani anan.
2. Umarnin Cire Bayanai (Data Extraction)
openclaw extract --selector "h1.title" --url
Yana cire abun ciki daga shafin yanar gizo ta amfani da mai zaɓin CSS. Mafi sauƙi kuma mafi sauri ga yawancin ayyuka.
Yi amfani da kayan aikin binciken burauza don samun zaɓin CSS daidai.
openclaw extract --xpath "//div[@class='item']/p" --url
Yana cire abun ciki daga shafin yanar gizo ta amfani da hanyar XPath. Mafi ƙarfi don tsarin bayanai masu rikitarwa.
XPath yana da sassauci fiye da CSS selector don zurfafan tsarin DOM.
openclaw extract --regex "/\d{4}-\d{2}-\d{2}/" --url
Yana cire abun ciki ta amfani da maganar yau da kullun (regular expression). Da amfani don tsarin rubutu na musamman.
Yi gwajin regex ɗin ku a wani wuri kafin amfani da shi a OpenClaw.
openclaw extract --table-auto --url
Yana gano kuma yana cire teburin bayanai daga shafin yanar gizo ta atomatik. Yana da amfani sosai ga shafukan da ke ɗauke da bayanai a cikin tebur.
Wannan umarnin yana aiki mafi kyau akan tebura masu tsari sosai.
openclaw extract --json-path "$.data.items[0].name" --url
Yana cire bayanai daga martanin JSON ta amfani da JSONPath. Da amfani don APIs na RESTful.
Fahimci tsarin JSON na martanin kafin rubuta JSONPath.
3. Umarnin Canza da Tsabtace Bayanai (Data Transformation & Cleaning)
openclaw transform --trim "field_name"
Yana cire farar gibba (whitespace) daga farko da ƙarshen filin da aka bayar. Yana taimakawa wajen tsabtace bayanai.
Koyaushe tsabtace filayen rubutu bayan cirewa don ingantaccen sakamako.
openclaw transform --replace "field_name" "old_text" "new_text"
Yana maye gurbin wani rubutu a cikin filin da aka bayar da wani rubutu daban. Da amfani don daidaita bayanai.
Yi amfani da regex a cikin 'old_text' don maye gurbin tsarin rikitarwa.
openclaw transform --format-date "field_name" "YYYY-MM-DD"
Yana canza tsarin kwanan wata a cikin filin da aka bayar zuwa tsari da aka so. Yana da mahimmanci don daidaita kwanan wata.
Tabbatar cewa filin kwanan wata yana da tsari mai fahimta kafin canzawa.
openclaw transform --deduplicate "field_name"
Yana cire kwafin rubutu daga filin da aka bayar. Yana tabbatar da cewa kowane shigarwa na musamman ne.
Yi amfani da wannan a kan filayen da ke da mahimmanci don ganowa na musamman, kamar IDs ko sunaye.
openclaw transform --convert-type "field_name" "integer"
Yana canza nau'in bayanai na filin da aka bayar, misali, daga rubutu zuwa lamba. Yana da amfani don lissafi.
Tabbatar cewa abun ciki na filin yana dacewa da nau'in da kuke son canzawa.
4. Umarnin Fitar da Bayanai (Exporting Data)
openclaw export --format csv --output data.csv
Yana fitar da bayanan da aka cire zuwa fayil na CSV. Mafi yawan tsari don raba bayanai.
Amfani da 'openclaw extract ... | openclaw export ...' don haɗa umarni.
openclaw export --format json --output data.json
Yana fitar da bayanan da aka cire zuwa fayil na JSON. Yana da kyau ga masu haɓaka software.
JSON yana adana tsarin bayanai fiye da CSV, yana da amfani ga bayanai masu rikitarwa.
openclaw export --format excel --output data.xlsx
Yana fitar da bayanan da aka cire zuwa fayil na Excel (.xlsx). Da amfani ga masu amfani da spreadsheet.
Wannan tsari yana da kyau don gabatar da bayanai ga waɗanda ba fasaha ba.
openclaw export --db postgres --table "my_scraped_data"
Yana fitar da bayanan da aka cire kai tsaye zuwa teburin bayanai na PostgreSQL. Yana da mahimmanci don haɗawa da aikace-aikace.
Saita bayanan shiga na bayanai a cikin fayil ɗin sanyi na OpenClaw.
openclaw export --api --auth-token
Yana fitar da bayanan da aka cire zuwa wani API na waje ta hanyar POST request. Da amfani don haɗawa da sabis na girgije.
Tabbatar cewa API endpoint yana tsammanin tsarin bayanai da OpenClaw ke fitarwa.
5. Umarnin Saiti da Sarrafa Kansa (Configuration & Automation)
openclaw config set --key "proxy.url" --value "http://myproxy:8080"
Yana saita ƙimar sanyi don OpenClaw. Waɗannan saitunan suna dawwama tsakanin zaman.
Kuna iya saita saitunan duniya kamar wakilai, iyakokin buƙatu, da bayanan shiga.
openclaw config show
Yana nuna duk saitunan da aka saita a halin yanzu don OpenClaw. Yana da amfani don duba saitunan ku.
Duba saitunan ku kafin fara aiki mai girma don tabbatar da komai ya daidaita.
openclaw schedule --daily "my_scrape_job.yml"
Yana tsara aikin cire bayanai don gudana yau da kullum. Yana buƙatar fayil ɗin sanyi na aiki.
Yi amfani da fayil ɗin YAML don bayyana cikakkun matakan aikin cire bayanai.
openclaw run --script "my_custom_logic.js"
Yana gudana da rubutun JavaScript na al'ada. Yana ba da damar ƙarin sassauci da dabaru masu rikitarwa.
Rubutun JavaScript na iya shiga cikin bayanan da aka cire kuma suyi ƙarin sarrafawa.
openclaw init --project "new_project"
Yana ƙirƙirar sabon aikin OpenClaw tare da tsarin fayiloli na asali. Yana da amfani don farawa da sauri.
Wannan yana haifar da fayilolin sanyi da samfuran rubutun don ku fara.
6. Umarnin Cire Bayanai na Ci Gaba (Advanced Scraping)
openclaw interact --browser chrome --url "https://example.com/login"
Yana buɗe burauza mai sarrafa kansa don mu'amala da shafukan yanar gizo masu amfani da JavaScript. Yana da mahimmanci don shafuka masu ƙarfi.
Kuna iya yin mu'amala da shafi ta atomatik, kamar danna maɓallai ko cika fom.
openclaw login --form "#login-form" --user "myuser" --pass "mypass"
Yana sarrafa tsarin shiga ta atomatik ta hanyar nemo fom kuma cika bayanan shiga. Yana da mahimmanci ga shafuka masu kariya.
Tabbatar cewa zaɓin fom daidai ne don guje wa kuskuren shiga.
openclaw paginate --next-selector "a.next-page" --limit 10
Yana sarrafa pagination ta atomatik ta danna maɓallin 'na gaba' har zuwa iyakacin shafuka. Yana da amfani don tattara bayanai masu yawa.
Gwada zaɓin 'na gaba' sosai don tabbatar da cewa yana aiki a duk shafukan.
openclaw monitor --url "https://example.com/products" --interval 3600
Yana lura da canje-canje a shafin yanar gizo a wani tazara da aka bayar. Yana sanar da ku idan an gano sabbin bayanai ko canje-canje.
Haɗa wannan tare da umarnin fitarwa don adana canje-canje kai tsaye.
openclaw captcha solve --image "captcha.png" --service "2captcha"
Yana haɗawa da sabis na warware captcha don magance captchas ta atomatik yayin aikin cire bayanai. Yana da mahimmanci ga shafuka masu kariya.
Saita API key na sabis ɗin captcha a cikin saitunan OpenClaw.