1. Вступайте в наш Telegram чат: https://t.me/a_parser Нас уже 2600+ и мы растем!
    Скрыть объявление

Улучшение [1.1.293] Парсинг с помощью Net::Whois e-mail'ов

Тема в разделе "1.1.323", создана пользователем Support, 21 май 2015.

  1. Support

    Support Administrator
    Команда форума A-Parser Enterprise

    Регистрация:
    16 мар 2012
    Сообщения:
    4.545
    Симпатии:
    2.163
    Для примера рассмотрим сайт http://paddypower.com/
    Парсинг с помощью Net::Whois выдает следующее:
    Whois Server Version 2.0

    Domain names in the .com and .net domains can now be registered
    with many different competing registrars. Go to http://www.internic.net
    for detailed information.

    Domain Name: PADDYPOWER.COM
    Registrar: SAFENAMES LTD
    Sponsoring Registrar IANA ID: 447
    Whois Server: whois.safenames.net
    Referral URL: http://www.safenames.net
    Name Server: PDNS1.ULTRADNS.NET
    Name Server: PDNS2.ULTRADNS.NET
    Name Server: PDNS3.ULTRADNS.ORG
    Name Server: PDNS4.ULTRADNS.ORG
    Name Server: PDNS5.ULTRADNS.INFO
    Name Server: PDNS6.ULTRADNS.CO.UK
    Status: clientDeleteProhibited http://www.icann.org/epp#clientDeleteProhibited
    Status: clientTransferProhibited http://www.icann.org/epp#clientTransferProhibited
    Updated Date: 06-jun-2013
    Creation Date: 20-jul-1998
    Expiration Date: 19-jul-2015

    >>> Last update of whois database: Wed, 20 May 2015 20:30:15 GMT <<<

    NOTICE: The expiration date displayed in this record is the date the
    registrar's sponsorship of the domain name registration in the registry is
    currently set to expire. This date does not necessarily reflect the expiration
    date of the domain name registrant's agreement with the sponsoring
    registrar. Users may consult the sponsoring registrar's Whois database to
    view the registrar's reported date of expiration for this registration.

    TERMS OF USE: You are not authorized to access or query our Whois
    database through the use of electronic processes that are high-volume and
    automated except as reasonably necessary to register domain names or
    modify existing registrations; the Data in VeriSign Global Registry
    Services' ("VeriSign") Whois database is provided by VeriSign for
    information purposes only, and to assist persons in obtaining information
    about or related to a domain name registration record. VeriSign does not
    guarantee its accuracy. By submitting a Whois query, you agree to abide
    by the following terms of use: You agree that you may use this Data only
    for lawful purposes and that under no circumstances will you use this Data
    to: (1) allow, enable, or otherwise support the transmission of mass
    unsolicited, commercial advertising or solicitations via e-mail, telephone,
    or facsimile; or (2) enable high volume, automated, electronic processes
    that apply to VeriSign (or its computer systems). The compilation,
    repackaging, dissemination or other use of this Data is expressly
    prohibited without the prior written consent of VeriSign. You agree not to
    use electronic processes that are automated and high-volume to access or
    query the Whois database except as reasonably necessary to register
    domain names or modify existing registrations. VeriSign reserves the right
    to restrict your access to the Whois database in its sole discretion to ensure
    operational stability. VeriSign may restrict or terminate your access to the
    Whois database for failure to abide by these terms of use. VeriSign
    reserves the right to modify these terms at any time.

    The Registry database contains ONLY .COM, .NET, .EDU domains and
    Registrars.

    For more information on Whois status codes, please visit
    https://www.icann.org/resources/pages/epp-status-codes-2014-06-16-en.

    Если этот же сайт проверить, к примеру, на http://whois.icann.org/, то получим такой результат:
    Domain Name: PADDYPOWER.COM
    Registrar WHOIS Server: whois.safenames.net
    Registrar URL: http://www.safenames.net
    Updated Date: 2013-06-06T12:31:24Z
    Created Date: 1998-07-20T04:00:00Z
    Registrar Registration Expiration Date: 2015-07-19T04:00:00Z
    Registrar: Safenames Ltd
    Registrar IANA ID: 447
    Registrar Abuse Contact Email: [email protected]
    Registrar Abuse Contact Phone: +44.1908200022
    Registrant Name: Domain Manager
    Registrant Organisation: Power Leisure Bookmakers Limited
    Registrant Address Line 1: Belfield Office Park
    Registrant Address Line 2: Beechill Road
    Registrant City: Dublin
    Registrant State/Province:
    Registrant Postal Code: Dublin 4
    Registrant Country: IE
    Registrant Phone: +353.19050999
    Registrant Fax:
    Registrant Email: [email protected]
    Admin Name: International Domain Administrator
    Admin Organisation: Safenames Ltd
    Admin Address Line 1: Safenames House, Sunrise Parkway
    Admin Address Line 2:
    Admin City: Milton Keynes
    Admin State/Province: Bucks
    Admin Postal Code: MK14 6LS
    Admin Country: UK
    Admin Phone: +44.1908200022
    Admin Fax: +44.1908325192
    Admin Email: [email protected]
    Tech Name: International Domain Tech
    Tech Organisation: International Domain Tech
    Tech Address Line 1: Safenames House, Sunrise Parkway
    Tech Address Line 2:
    Tech City: Milton Keynes
    Tech State/Province: Bucks
    Tech Postal Code: MK14 6LS
    Tech Country: UK
    Tech Phone: +44.1908200022
    Tech Fax: +44.1908325192
    Tech Email: [email protected]
    Name Server: pdns1.ultradns.net
    Name Server: pdns2.ultradns.net
    Name Server: pdns4.ultradns.org
    URL of the ICANN WHOIS Data Problem Reporting System: http://wdprs.internic.net/

    Safenames - Experts in Global Domain Management and Online Brand Protection

    Domain Registration in over 760 different extensions
    Enterprise Domain Management since 1999
    Mark Protect™ Online Brand Monitoring and Enforcement
    Domain Consulting and Strategy
    Domain Name Acquisition
    Domain Disputes and Recovery

    Visit Safenames at www.safenames.net
    +1 703 574 5313 in the US/Canada
    +44 1908 200022 in Europe

    The Data in the Safenames Registrar WHOIS database is provided by Safenames for
    information purposes only, and to assist persons in obtaining information about
    or related to a domain name registration record. Safenames does not guarantee
    its accuracy. Additionally, the data may not reflect updates to billing
    contact information.

    By submitting a WHOIS query, you agree to use this Data only for lawful purposes
    and that under no circumstances will you use this Data to:

    (1) allow, enable, or otherwise support the transmission of mass unsolicited,
    commercial advertising or solicitations via e-mail, telephone, or facsimile; or
    (2) enable high volume, automated, electronic processes that apply to Safenames
    (or its computer systems). The compilation, repackaging, dissemination or
    other use of this Data is expressly prohibited without the prior written
    consent of Safenames. Safenames reserves the right to terminate your access to
    the Safenames Registrar WHOIS database in its sole discretion, including
    without limitation, for excessive querying of the WHOIS database or for failure
    to otherwise abide by this policy. Safenames reserves the right to modify
    these terms at any time. By submitting this query, you agree to abide by this
    policy.

    Суть улучшения состоит в том, чтобы реализовать получение дополнительных данных, таких как e-mail, и выводить их через отдельную переменную, или хотя бы чтобы они отражались в raw data.
     
    Master и Metroid нравится это.
  2. Forbidden

    Forbidden Administrator
    Команда форума A-Parser Enterprise

    Регистрация:
    9 мар 2013
    Сообщения:
    3.337
    Симпатии:
    1.794
    Добавлена опция Recursive query, которая позволяет получить расширенную версию WHOIS
     
  3. Support

    Support Administrator
    Команда форума A-Parser Enterprise

    Регистрация:
    16 мар 2012
    Сообщения:
    4.545
    Симпатии:
    2.163
    Пресет для парсинга e-mail'ов
    [​IMG]
    Код:
    eyJwcmVzZXQiOiJSZWN1cnNpdmUgV2hvaXMgKyBFbWFpbHMiLCJ2YWx1ZSI6eyJw
    cmVzZXQiOiJSZWN1cnNpdmUgV2hvaXMgKyBFbWFpbHMiLCJwYXJzZXJzIjpbWyJO
    ZXQ6Oldob2lzIiwiZGVmYXVsdCIseyJ0eXBlIjoiY3VzdG9tUmVzdWx0IiwicmVz
    dWx0IjoiZGF0YSIsInJlZ2V4IjoiKCg/PlxcYlstYS16MC05Ll8lK10rKUBbYS16
    MC05Li1dK1xcLlthLXpdezIsNn0pXFxiIiwicmVnZXhUeXBlIjoiaWciLCJyZXN1
    bHRUeXBlIjoiYXJyYXkiLCJhcnJheU5hbWUiOiJlbWFpbHMiLCJyZXN1bHRzIjpb
    ImVtYWlsIl19LHsidHlwZSI6Im92ZXJyaWRlIiwiaWQiOiJmb3JtYXRyZXN1bHQi
    LCJ2YWx1ZSI6IiRlbWFpbHMuZm9ybWF0KCckcXVlcnk7JGVtYWlsXFxuJykifSx7
    InR5cGUiOiJvdmVycmlkZSIsImlkIjoicmVjdXJzZSIsInZhbHVlIjp0cnVlfSx7
    InR5cGUiOiJvdmVycmlkZSIsImlkIjoicmF3ZGF0YSIsInZhbHVlIjp0cnVlfSx7
    InR5cGUiOiJ1bmlxdWUiLCJyZXN1bHQiOlsiZW1haWxzIiwiZW1haWwiXSwidW5p
    cXVlVHlwZSI6InN0cmluZyIsInVuaXF1ZUdsb2JhbCI6dHJ1ZX1dXSwicmVzdWx0
    c0Zvcm1hdCI6IiRwMS5wcmVzZXQiLCJyZXN1bHRzU2F2ZVRvIjoiZmlsZSIsInJl
    c3VsdHNGaWxlTmFtZSI6IiRkYXRlZmlsZS5mb3JtYXQoKS50eHQiLCJhZGRpdGlv
    bmFsRm9ybWF0cyI6W10sInJlc3VsdHNVbmlxdWUiOiJubyIsInF1ZXJ5Rm9ybWF0
    IjpbIiRxdWVyeSJdLCJ1bmlxdWVRdWVyaWVzIjpmYWxzZSwic2F2ZUZhaWxlZFF1
    ZXJpZXMiOmZhbHNlLCJpdGVyYXRvck9wdGlvbnMiOnsib25BbGxMZXZlbHMiOmZh
    bHNlLCJxdWVyeUJ1aWxkZXJzQWZ0ZXJJdGVyYXRvciI6ZmFsc2UsInF1ZXJ5QnVp
    bGRlcnNPbkFsbExldmVscyI6ZmFsc2V9LCJyZXN1bHRzT3B0aW9ucyI6eyJvdmVy
    d3JpdGUiOmZhbHNlfSwiZG9Mb2ciOiJubyIsImtlZXBVbmlxdWUiOiJObyIsIm1v
    cmVPcHRpb25zIjpmYWxzZSwicmVzdWx0c1ByZXBlbmQiOiIiLCJyZXN1bHRzQXBw
    ZW5kIjoiIiwicXVlcnlCdWlsZGVycyI6W10sInJlc3VsdHNCdWlsZGVycyI6W10s
    ImNvbmZpZ092ZXJyaWRlcyI6W119fQ==
    Результат:
     
    Metroid, Max, relay и ещё 1-му нравится это.

Поделиться этой страницей