

For example, you’ll be asked to login or register when you first visit certain sites. The result web page would contain some content that may prevent you from visiting the site the first time you open it. The URL you entered would redirect to a different URL and the result page is not that you want. In this case, you can enter some strings of the different web page/URL into the “the URL of the result page contains”textbox to reload the website.

When you open a URL in Octoparse, you can choose to reload the website when one of the following situations appear: Retry in following conditions: You can retry to load the website in following conditions. And the cookies of the current web page will be shown in the drop down menu. (Usually used for websites with infinite scrolling.)Ĭlear Cache: Choose this option to clear cache before opening the web page.Ĭustomize Cookie: Choose this option to use specified cookieĬookie: Click “Load cookie from current web page”, then click anywhere in the blank space of “Cookie” textbox. You can choose scroll down interval time and to scroll to the end of the page/ scroll down for one screen. Scroll Down: Scroll down to the bottom of the page when finished loading. When you choose this option, you will see a pop-up window saying it’s available only when the current step is the first sub-step of Loop item. Use Loop URL: Use the current loop items as navigation URLs.
#Octoparse create template windows#
Block Pop-up : Block pop-up windows (Possible ads) Timeout: Set up the maximum time to load the page. Or you can drag an “Open a webpage” action, drop it into Workflow Designer, enter the URL in the “Page URL” textbox and click “Save” to open the target website/webpage. Directly enter the URL in the address bar of the built-in browser and click ”Go”, then the “Open a webpage” action will be automatically created. To learn more about dealing with AJAX in Octoparse, please refer to Deal with AJAX. Click "Load the page with AJAX" on the "Customize Action".Instagram uses AJAX on the ">" button, so we need set up AJAX Load for "Click to Paginate" action as well. Click "Loop click next page" on the "Action Tips".However, as Instagram loads the content with AJAX, we should set up AJAX Load for the "Click Item" action.


But for this case, we need to revise the tag on the bottom of "Action Tips". Normally there’s no need to modify, as Octoparse automatically identifies tags of selected items. When you select an item with URL, the selected tag would be "A".
