Tsanangudzo yeSpidering uye Web Crawlers

Spiders & Web Crawlers: Zvaunoda Kuziva Kudzivirira Website Data

Tsangadzi mapurogiramu (kana automated scripts) 'anokamba' kuburikidza neWebvu kutsvaga deta. Tsanga dzinofamba paIndaneti dzewebsite uye dzinogona kukwevera data kubva pamapeji ewebhu semakero e-email. Nyeredzi dzinoshandiswa kudyora ruzivo runowanika pawebsite kuitsvaga injini.

Spiders, iyo inonziwo 'webcwlers' inotsvaga paWebhu uye kwete vose vane ushamwari mune vavariro dzavo.

Spammers Spider Websites Kuunganidza Mashoko

Google, Yahoo!

uye mamwe injini dzekutsvaga haisi ivo chete vanofarira kunyorera mawebhusayithi - saka vane scammers uye spammers.

Nyeredzi nedzimwe shanduro dzinoshandiswa zvinoshandiswa nevatambi kuti vatsvake maail email (paIndaneti tsika iyi inowanzonzi 'kukohwa') pawebsite uye zvino shandisa iyo kugadzira zvinyorwa zve spam.

Tsanga dzinoshandiswa nemitsva yekutsvaga kuti uwane mamwe ruzivo pamusoro pewebsite yako asi asiya isina kuvharwa, webhusaiti isina mazano (kana, 'mvumo') pamusoro pekukwazva nzvimbo yako inogona kupa hurukuro yekuchengeteka kwemashoko makuru. Tsanga dzinofamba nekutevera mazano, uye dzinonyanya kuwanikwa pakutsvaga mazano kune databases, purogiramu yepurogiramu, uye mamwe mashoko mausingadi kuti vawane.

VaWebmasters vanogona kutarisa mabheji kuti vaone kuti vaspidzi nemamwe mabhoti vakashanyira nzvimbo dzavo. Mashoko aya anobatsira webmasters kuziva kuti ndiani anonyora nzvimbo yavo, uye kangani.

Iyi ruzivo inobatsira nokuti inobvumira webmasters kuti vaite zvakanaka SEO yavo uye vashandise mafaira eti robot.txt kudzivisa mamwe mabhobhoti kuti arege kutamba nzvimbo yavo munguva yemberi.

Mazano ekuchengetedza Nzvimbo Yenyu Yacho Kubva Vanokwereta Bhobho Dzisina Kudiwa

Pane nzira yakanyatsojeka yekuchengetedza vanhu vasingadi kuwanikwa kunze kwewebsite yenyu. Kunyange kana iwe usingafungi nezvemagumisiro ane utsinye anokambaira nzvimbo yako (kubvisa kero yako ye-email hakuzokudziviriri kubva kune vakawanda vanokambaira), iwe unofanirwa kunge uchida kupa injini yekutsvaga nemirairo inokosha.

Zvose mawebsite anofanira kuva nefaira iri mu root directory inonzi robots.txt file. Iyi faira inokubvumira kuraira vashandisi vewebhu paunoda kuti vatarise pamapeji ekunyora (kunze kwekunge zvakataurwa pane imwe meta yedeta yepaji yega yega kuti irege kunyorwa) kana iri injini yekutsvaga.

Sezvo iwe unogona kutaurira vadiplers vaida waunoda kuti vatarise, iwe unogonawo kuvaudza kuti vangasaenda sei uye kunyange kudzivisa vamwe vakwegura kubva pawebsite yako yose.

Zvakakosha kuchengeta mupfungwa kuti chitubu chakaiswa pamwe nemabhoti.txt faira ichave yakakosha zvikuru kune injini yekutsvaga uye inogona kunge iri chinhu chinokosha pakuvandudza iwe webhusaiti yako, asi mamwe ma robot crawlers achakanganwa mirairo yako. Nokuda kwechikonzero ichi, zvakakosha kuchengeta zvose software yako, plugins, uye mapurogiramu kusvika panguva dzose.

Zvakafanana Nyaya uye Mashoko

Pamusana pekupararira kwemashoko ekukohwa akashandiswa nefarious (spam) zvinangwa, mutemo wakapiwa muna 2003 kuita zvimwe zviito zvisiri pamutemo. Iyi mitemo yekudzivirira vatengi inowira pasi peCAN-SPAM Act ye 2003.

Zvakakosha kuti utore nguva yekuverenga pamusoro peCHA-SPAM Mutemo kana bhizinesi rako richiita chero mazita ekutumirwa kwemashoko kana kukohwa mashoko.

Iwe unogona kuwana zvakawanda pamusoro pemitemo yekuzvidza-spam uye kuti ungagadzirisa sei spammers, uye iwe iwe sebhizimisi bhizinesi chingaita, nekuverenga nyaya dzinotevera: