Documente online.
Zona de administrare documente. Fisierele tale
Am uitat parola x Creaza cont nou
 HomeExploreaza
upload
Upload




Microsoft® Office SharePoint® Server 2007

software


Microsoft® Office SharePoint® Server 2007
Plan to crawl content worksheet

Fill in this worksheet using the following topics:



Plan to crawl content

After filling in this worksheet, use it with the following topics:

Deployment for Office SharePoint Server 2007

Prepared by:

Date:

Shared Services Provider (SSP)

Specify the SSP name to which this worksheet pertains.

Note   Most organizations use only one SSP. If you are planning to use multiple SSPs, use a separate worksheet for each SSP.

SSP name

Default content access account

Specify the content access account the crawler will use, by default, when crawling content.

Default content access account

Content sources

Use the following section of the worksheet to record your decisions about content sources. The section contains five tables - one for each content source type. If you need more than one content source for a particular type of content, copy the appropriate table, as needed.

SharePoint sites

Use the following table to specify a content source for crawling SharePoint sites.

Content source name

Content source type

SharePoint sites

Start addresses

Crawl settings
(how deep to crawl)

Select one:

Crawl everything under the host name for each start address.

Crawl only the SharePoint site of each start address.

Crawl schedule
(full crawl)

Type of schedule: Daily Weekly Monthly

Daily

Run every ________ days.

Starting time: ________

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Weekly

Run every _______ weeks.

On (select all that apply):

Monday

Tuesday

Wednesday

Thursday

Friday

Saturday

Sunday

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Monthly

On the ______ day of the month.

Select each month for which you want this schedule to apply:

January  May  September

February  June  October

March  July  November

April  August  December

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Crawl schedule
(incremental crawl)

Type of schedule: Daily Weekly Monthly

Daily

Run every ________ days.

Starting time: ________

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Weekly

Run every _______ weeks.

On (select all that apply):

Monday

Tuesday

Wednesday

Thursday

Friday

Saturday

Sunday

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Monthly

On the ______ day of the month.

Select each month for which you want this schedule to apply:

January  May  September

February  June  October

March  July  November

April  August  December

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

  • In the Content source name box, specify the name you want to assign to this content source. Each content source must have a unique name.
  • In the Start addresses box, specify each start address (URL) you want to crawl using this content source. Example: https://intranetsite.
  • In the Crawl settings (how deep to crawl) box, select how deep you want to crawl each start address in this content source.
  • In the Crawl schedule (full crawl) box, choose a daily, weekly, or monthly schedule, and then specify the choices for that type of schedule.
  • In the Crawl schedule (incremental crawl) box, choose a daily, weekly, or monthly schedule, and then specify the details you want for that type of schedule.

Web sites

Use the following table to specify a content source for crawling Web sites.

Content source name

Content source type

Web sites

Start addresses

Crawl settings
(how deep to crawl)

Select one:

Crawl only within the server of each start address.

Crawl only the first page of each start address.

Custom - specify page depth and server hops:

Limit page depth to _______ pages (default is unlimited).

Limit server hops to _______ hops (default is unlimited).

Crawl schedule
(full crawl)

Type of schedule: Daily Weekly Monthly

Daily

Run every ________ days.

Starting time: ________

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Weekly

Run every _______ weeks.

On (choose all that apply):

Monday

Tuesday

Wednesday

Thursday

Friday

Saturday

Sunday

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Monthly

On the ______ day of the month.

Select each month for which you want this schedule to apply:

January  May  September

February  June  October

March  July  November

April  August  December

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Crawl schedule
(incremental crawl)

Type of schedule: Daily Weekly Monthly

Daily

Run every ________ days.

Starting time: ________

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Weekly

Run every _______ weeks.

On (select all that apply):

Monday

Tuesday

Wednesday

Thursday

Friday

Saturday

Sunday

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Monthly

On the ______ day of the month.

Select each month for which you want this schedule to apply:

January  May  September

February  June  October

March  July  November

April  August  December

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

  • In the Content source name box, specify the name you want to assign to this content source. Each content source must have a unique name.
  • In the Start addresses box, specify each start address (URL) you want to crawl using this content source. Example: https://example.contoso.com/my_page.htm, or https://example.contoso.com.
  • In the Crawl settings (how deep to crawl) box, select how deep you want to crawl each start address in this content source. Note that the custom crawl setting enables you to choose how many pages deep to crawl and how many server hops to allow.
  • In the Crawl schedule (full crawl) box, choose a daily, weekly, or monthly schedule, and then specify the details you want for that type of schedule.
  • In the Crawl schedule (incremental crawl) box, choose a daily, weekly, or monthly schedule, and then specify the details you want for that type of schedule.

File shares

Use the following table to specify a content source for crawling file shares.

Content source name

Content source type

File shares

Start addresses

Crawl settings
(how deep to crawl)

Select one:

The folder and all subfolders of each start address.

Only the folder of each start address.

Crawl schedule
(full crawl)

Type of schedule: Daily Weekly Monthly

Daily

Run every ________ days.

Starting time: ________

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Weekly

Run every _______ weeks.

On (select all that apply):

Monday

Tuesday

Wednesday

Thursday

Friday

Saturday

Sunday

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Monthly

On the ______ day of the month:

Select each month for which you want this schedule to apply.

January  May  September

February  June  October

March  July  November

April  August  December

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Crawl schedule
(incremental crawl)

Type of schedule: Daily Weekly Monthly

Daily

Run every ________ days.

Starting time: ________

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Weekly

Run every _______ weeks.

On (select all that apply):

Monday

Tuesday

Wednesday

Thursday

Friday

Saturday

Sunday

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Monthly

On the ______ day of the month.

Select each month for which you want this schedule to apply:

January  May  September

February  June  October

March  July  November

April  August  December

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

  • In the Content source name box, specify the name you want to assign to this content source. Each content source must have a unique name.
  • In the Start addresses box, specify each start address (URL) you want to crawl using this content source. Example: \\server\directory, or file://server/directory.
  • In the Crawl settings (how deep to crawl) box, select how deep you want to crawl each start address in this content source.
  • In the Crawl schedule (full crawl) box, choose a daily, weekly, or monthly schedule, and then specify the details you want  for that type of schedule.
  • In the Crawl schedule (incremental crawl) box, choose a daily, weekly, or monthly schedule, and then specify the details you want for that type of schedule.

Exchange public folders

Use the following table to specify a content source for crawling Exchange public folders.

Content source name

Content source type

Exchange public folders

Start addresses

Crawl settings
(how deep to crawl)

Select one:

The folder and all subfolders of each start address.

Only the folder of each start address.

Crawl schedule
(full crawl)

Type of schedule: Daily Weekly Monthly

Daily

Run every ________ days.

Starting time: ________

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Weekly

Run every _______ weeks.

On (select all that apply):

Monday

Tuesday

Wednesday

Thursday

Friday

Saturday

Sunday

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Monthly

On the ______ day of the month.

Select each month for which you want this schedule to apply:

January  May  September

February  June  October

March  July  November

April  August  December

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Crawl schedule
(incremental crawl)

Type of schedule: Daily Weekly Monthly

Daily

Run every ________ days.

Starting time: ________

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Weekly

Run every _______ weeks.

On (select all that apply):

Monday

Tuesday

Wednesday

Thursday

Friday

Saturday

Sunday

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Monthly

On the ______ day of the month.

Select each month for which you want this schedule to apply:

January  May  September

February  June  October

March  July  November

April  August  December

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

  • In the Content source name box, specify the name you want to assign to this content source. Each content source must have a unique name.
  • In the Start addresses box, specify each start address (URL) you want to crawl using this content source. Example: https://exchangeserver/public/folder/subfolder.
  • In the Crawl settings (how deep to crawl) box, select how deep you want to crawl each start address in this content source.
  • In the Crawl schedule (full crawl) box, choose a daily, weekly, or monthly schedule, and then specify the details you want for that type of schedule.
  • In the Crawl schedule (incremental crawl) box, choose a daily, weekly, or monthly schedule, and then specify the details you want for that type of schedule.

Business data

Use the following table to specify a content source for crawling business data, sometimes called line-of-business data.

Content source name

Content source type

Business data

Start addresses

Crawl settings
(how deep to crawl)

Select one:

Crawl entire Business Data Catalog.

Crawl selected applications.

Crawl schedule
(full crawl)

Type of schedule: Daily Weekly Monthly

Daily

Run every ________ days.

Starting time: ________

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Weekly

Run every _______ weeks.

On (select all that apply):

Monday

Tuesday

Wednesday

Thursday

Friday

Saturday

Sunday

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Monthly

On the ______ day of the month.

Select each month for which you want this schedule to apply:

January  May  September

February  June  October

March  July  November

April  August  December

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Crawl schedule
(incremental crawl)

Type of schedule: Daily Weekly Monthly

Daily

Run every ________ days.

Starting time: ________

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Weekly

Run every _______ weeks.

On (select all that apply):

Monday

Tuesday

Wednesday

Thursday

Friday

Saturday

Sunday

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

Monthly

On the ______ day of the month.

Select each month for which you want this schedule to apply:

January  May  September

February  June  October

March  July  November

April  August  December

Starting time: _______

Repeat within the day (optional)

Every _____ minutes.

For ______ minutes.

  • In the Content source name box, specify the name you want to assign to this content source. Each content source must have a unique name.
  • In the Start addresses box, specify each start address (URL) you want to crawl using this content source.
  • In the Crawl settings (how deep to crawl) box, select how deep you want to crawl each start address in this content source.
  • In the Crawl schedule (full crawl) box, choose a daily, weekly, or monthly schedule, and then specify the details you want for that type of schedule.
  • In the Crawl schedule (incremental crawl) box, choose a daily, weekly, or monthly schedule, and then specify the details you want for that type of schedule.

Crawler impact rules

Use this section of the worksheet to record your decisions regarding crawler impact rules. Make a copy of this table for each crawler impact rule you want to define.

Site (URL)

Request frequency

Choose one of the following:

Request up to the specified number of documents at a time.

(Select one.)

- or -

Request one document at a time and wait _______ seconds
between requests.

  • In the Site (URL) box, specify the URL that will be associated with this crawler impact rule.
  • In the Request frequency section, choose one of the following:
    • Request up to the specified number of documents at a time and do not wait between requests. If you choose this option, select how many documents you want the crawler to request at a time when crawling this URL.
    • Request only one document at a time and wait the specified time between requests. If you choose this option, specify how many seconds to wait between requests.

Protocol handlers

Use the following table to record any third-party or custom protocol handlers you will need during deployment.

Tip   Review the start addresses listed in the Content sources section of this worksheet to determine what protocol handlers are required to access the data that you want to crawl, and then list in the following table the protocol handlers that are not provided, by default.

Protocol handlers

Crawl rules

Use this section of the worksheet to record your decisions regarding crawl rules. Make a copy of this table for each crawl rule you want to define.

Path

Crawl configuration

Choose one of the following options:

Exclude all items in this path.

- or -

Include all items in this path and optionally select any of the following:

Follow links on the URL without crawling the URL itself.

Crawl complex URLs.

Crawl content in SharePoint sites as HTTP pages.

Specify content access account

Choose one of the following options:

Use the default content access account when crawling this path.

- or -

Use this content access account _____ _______ ______ ___________
(domain\account)

Allow basic authentication? Yes  No

- or -

Use the client certificate _____ _______ ______ __________

  • In the Path box, specify the path as a single URL or, by using wildcards, as a set of URLs. Examples: https://servername/path/folder; https://servername/path/*.
  • In the Crawl configuration section, choose to either exclude or include all items in the path. If you choose to include all items in the path you can also choose any or all of the following options:
    • Follow links on the URL without crawling the URL itself.
    • Crawl complex URLs, such as URLs that contain a question mark.
    • Crawl content in SharePoint sites as HTTP pages.
    • In the Specify content access account section, choose to use the default content access account, use a different content access account that you specify, or use a client certificate when crawling this path. If you choose to specify a content access account (other than the default content access account), you can choose whether to allow basic authentication. Note that basic authentication is not allowed, by default.

File-type inclusions

Use the following table to record your decisions about the file types that you want to include in the file-type inclusions list.

File types to add

Require additional IFilter? (Yes/No)

  • In the File types to add column, list the file name extensions for the file types you want to crawl. Note that it is not necessary to list the file name extensions that are included on the file-type inclusion list, by default.
  • In the Require IFilter column, specify whether a default IFilter is provided that supports this file type.

Use the following table to list the file types that you do not want to crawl and that you want to remove from the file-type inclusion list.

File types to remove

  • In the File types to remove column, list the file name extensions for the file types that are listed on the file-type inclusion list, by default, that you do not want to crawl.

Word breakers and stemmers

Use the following table to record the languages for which you need to install word breakers and stemmers.

Languages of word breakers and stemmers

Farm-level search settings

Use the tables in this section to record the decisions you make about farm-level search settings.

Contact e-mail address

Record the e-mail address of the person in your organization whom external site administrators can contact if problems arise when their site is being crawled.

Contact e-mail address

Proxy server settings

Will you configure proxy server settings to use when crawling other servers?

Yes No

If yes, use the following table to record the proxy server settings to use.

Address (required)

Port (optional)

Bypass proxy server for local (intranet) addresses?

Yes  No

Do not use proxy server for addresses beginning with:

The Address can be the either the NetBIOS name or the IP address of the proxy server.

Time-out settings

Use the following table to record the amount of time that the search server will wait while connecting to other services.

Connection time
(in seconds)

Request acknowledgement time

(in seconds)

SSL certificate warning configuration

Do you want to ignore Secure Sockets Layer (SSL) certificate name warnings and trust that sites are legitimate even if their certificate names are not exact matches?

Yes No


Document Info


Accesari: 968
Apreciat: hand-up

Comenteaza documentul:

Nu esti inregistrat
Trebuie sa fii utilizator inregistrat pentru a putea comenta


Creaza cont nou

A fost util?

Daca documentul a fost util si crezi ca merita
sa adaugi un link catre el la tine in site


in pagina web a site-ului tau.




eCoduri.com - coduri postale, contabile, CAEN sau bancare

Politica de confidentialitate | Termenii si conditii de utilizare




Copyright © Contact (SCRIGROUP Int. 2024 )