Post

2 followers Follow
0
Avatar

Excluding Websites

Hello, I am wondering if anyone can help me in a way to exclude a certain website from being captured, to further elaborate I have a running a stream that looks into (All sources) and keeps on capturing pages from WikiPedia, I looked into the documentation but couldn't find what I need to know so I used:
(AND NOT interaction.link url_in "wikipedia.org/wiki") to exclude those pages but I am not sure it will work.

Any Ideas?

Mostafa Amro

Please sign in to leave a comment.

4 comments

0
Avatar

Thank you Jason, but it seems that (AND NOT links.domain contains "wikipedia") will exclude pages/content that contains a WikiPedia URL, as it says on the page
Examples:
1. Filter for Tweets that contain a link to any eBay.com page:
links.domain == "ebay.com"

  1. Filter for posts mentioning URLs that include any domains from a list: links.domain IN "thetimes.co.uk, blogs.thetimes.co.uk, subdomain.thetimes.co.uk"

In my case I want to exclude the (source Wikipedia) not exclude a forum post that contains a URL to Wikipedia.

Thank you

Mostafa Amro 0 votes
Comment actions Permalink
0
Avatar

Ok, I think I understand - you don't want to receive any interactions from the Wikipedia data source. You can use the interaction.type target to specify which data sources you do/do not want to receive interactions from. For example, the following will only return interactions from the twitter and Facebook data sources:

interaction.type in "twitter, facebook"

http://dev.datasift.com/docs/targets/common-interaction/interaction-type

Jason D. 0 votes
Comment actions Permalink
0
Avatar

Thank you Jason, I think I'll just go to Sources on the dashboard and deactivate Wikipedia.

Mostafa Amro 0 votes
Comment actions Permalink