Yahoo pipes+Autoblogged 全文输出
首先,你需要安装英文版本的WordPress,最新版本目前是2.91。
这里有AutoBlogged 2.5.74版本:
http://www.cnwebmasters.com/thread-37044-1-1.html
下载之后选择直接上传安装。
安装完之后,你可以看见你的控制台最下面,有AutoBlogged的链接,然后点开之后可以看见 Add New Feed.
其实关键是找Feed,AutoBlogged虽然可以自动采集,但是很多Feed都不是全文,都是只有前面的一段,所以采集之后都是概要。
这里,我们就需要借助一个东西,就是Yahoo 的 Pipes。
http://pipes.yahoo.com/pipes/
上面就是Yahoo的Pipes,注册之后,选择新建一个Pipes,然后点“Sources”-“Fetch Feed”,把你找到的采集点,当然只是博客的RSS加上去,就差不多完成了一个采集点。
多找几个采集点,用上面的方法重复几次。
然后将这个“Fetch Feed”拖到Pipe Output,就完成了多个RSS Feed的合烧工作。
然后叫合烧的地址,加到你的AutoBlogged的Add New Feed里,就差不多完工了。
最后运行采集,大功告成。
这里我要说明的几点,如果你做垃圾站,不能做完不管,那样根本获取不了SEO的 流量。把我之前推荐的WP插件的文章看看,装一些插件上去。定期的更新采集,同时做下SEO优 化。
最好能够用一些SEO工 具,多对你的垃圾博客加些外链,这样很容易提高权重。
最重要的一点,不要乱采集,最好保留人家的出处,不然被投诉了,你的主机也完蛋了。
相关英文原文:
Yahoo pipes tutorial for full-feed autoblog
I noticed there are many people who are clueless about Yahoo Pipes or can’t figure it out. First off, let me explain what Pipes is. “Pipes is a powerful composition tool to aggregate, manipulate, and mash-up content from around the web.” Some people will use Pipes to takes several RSS feeds and combine them into 1 feed for use with their website. Others might use Pipes to take a RSS feed that only displays a summary and convert the feed to display the entire content of the article or news story. Or you can use both techniques to aggregate or combine multiple summary feeds into a “content super feed” (I’m trademarking that LOL). For this tutorial, I will explain the second technique.
Step1
Sign up if you haven’t already: http://pipes.yahoo.comStep 2
Click the “Create a pipe” button on the main page.Step 3
On the left menu, drag the “Fetch Feed” module into the grid. At this point you need to add your RSS feed that you want to convert into the “URL” text-box of this module. Use the XML version of your feed. For this example, I will use a health feed from the Washington Post:Code:feed://feeds.washingtonpost.com/wp-dyn/rss/health/index_xml
The purpose of this module is to tell Pipes what feed we are going to be working with. Simple!Step 4
Drag the “Loop” module into the grid. The “Loop” module is located in the Operators submenu on the left menu. Click the little arrow to the left of “Operators” to display the submenu. After you have done this, you will notice there is another grid located inside of this “Loop” module. We will drag another module here in the next step.The purpose of this module is to loop through each RSS item from the feed we specified in the previous module. For example, our feed has 100 stories in it. This module is going to loop through each story, 1 at a time, and do what we tell it to do.
Step 5
Drag the “Fetch Page” module into the “Loop” grid from the previous step. You should see a red box outlining the grid of the “Loop” module when you are hovering correctly. Now select the first dropdown next to “URL” and select item.link or type that in exactly.The purpose of this module will basically look at the source page with in the RSS feed and strip out the full story. Since we threw this module into a “Loop” module, this will be looped for each RSS item(story) and grab the full story from the source page.
Step 6
Drag the “Regex” module onto the main grid. This module is located in the “Operators” submenu.The purpose of this module is to manipulate the story, link, or even title. This module is only optional. Some examples I’ve used are to strip all links from a story. Or to remove the season and episode number from the title on a Hulu feed. Or to change every instance of the word Blackhat and make it output the word Whitehat instead. There are many regex examples out there on the web.
Step 7
Connect these all up. Currently these modules are all separate and there is no data flow from each module to the next. Data comes in the top of the module, filters through the module, then exits the module on the bottom. So click and hold the little circle at the bottom of the first module(“Fetch Feed”) and drag it to the top circle of the “Loop” module and release the click. You should see a connection or Pipe between the two modules. Now connect the bottom of the “Loop” module to the top of the “Regex” module. Finally, connect the bottom of the “Regex” module to the top of the “Pipe Output”Now that the design is setup, you should test your connections. If you click on the “Pipe Output” module on the grid, it should turn orange. And at the bottom of the webpage, it should be generating the feed. After completion, you should see a list of your RSS stories in the bottom pane. If not, or there is an error, click the Refresh button in the bottom panes a few times. If you do not see any items then you either forgot to put the URL to your feed from Step 3 or you messed up your connections in Step 7. Try again or try another feed to test. Once you see your items in the bottom pane, you may continue.
Step 8
We need to find markers or characteristics on the source pages to pull out the full story. Open up your original RSS feed in a new browser or tab. Click on the first story so we are now on the source website reading the original story. Now on your browser, you need to view the source of the webpage. We need to find the story in this source. We also need to find a unique marker just before and just after the story. Here’s a snippet of an article from the feed:Code:Washington Post Staff Writer
Tuesday, January 5, 2010
Scientists may have created a vaccine against cocaine addiction: a series of shots that changes the body’s chemistry so that the drug can’t enter the brain and provide a high.
The vaccine, called TA-CD, shows promise but could also be dangerous; some of the addicts participating in a study of the vaccine started doing massive amounts of cocaine in hopes of overcoming its effects, according to Thomas R. Kosten, the lead researcher on the study, which was published in the Archives of General Psychiatry in October.
“After the vaccine, doing cocaine was a very disappointing experience for them,” said Kosten, a professor of psychiatry and neuroscience at Baylor College of Medicine in Houston.
Nobody overdosed, but some of them had 10 times more cocaine coursing through their systems than researchers had encountered before, according to Kosten. He said some of the addicts reported to researchers that they had gone broke buying cocaine from multiple drug dealers, hoping to find a variety that would get them high.
Of the 115 addicts in the study, 58 were given the vaccine, administered in a series of five shots over 12 weeks, while 57 received placebo injections. Six people dropped out before the end of the study. The researchers recruited the participants from a methadone-treatment program in West Haven, Conn., which made it possible to track them for the full 24 weeks of the study. The patients were addicted to cocaine and heroin; TA-CD is designed to work only on cocaine, including the crack form of the drug.
Like disease vaccines, TA-CD stimulates a person’s immune system to produce antibodies. Of those who received all five vaccine injections, 38 percent reached antibody levels that were high enough to dull the effects of the drug. The antibodies stayed active for eight to 10 weeks after the last shot.
In the high-antibodies group, 53 percent stayed off cocaine more than half the time once they had built up immunity. That compares with 23 percent of those who produced fewer antibodies. The researchers monitored cocaine use through regular urinalysis.
“In this study, immunization did not achieve complete abstinence from cocaine use,” Kosten said. “Previous research has shown, however, that a reduction in use is associated with a significant improvement in cocaine abusers’ social functioning and thus is therapeutically meaningful.”
About a quarter of those who received the vaccine did not make sufficient antibodies at all; Kosten isn’t sure why.
“That’s the million-dollar question,” said Margaret Haney, a professor of clinical neuroscience at Columbia University Medical Center, who is also researching the cocaine vaccine though she was not involved in Kosten’s study.
In October, the journal Biological Psychiatry published online an article by Haney that also tested the effects of TA-CD.
Through newspaper ads, Haney had recruited 15 cocaine-dependent men to participate in her study. (Only 10 stayed to the end.)
In the beginning of the story, you should see the text “”. This is unique to the page meaning there is only 1 instance of it on the page source AND it is on every news story on this feed. This will be our beginning marker. Hey look at the end, Washington Post is handing us their shit on a silver platter “”. This will be our end marker. Now back to Yahoo Pipes.
Step 9
Within the “Fetch Page” module, you will see an area that says “Cut content from:” and this first box will be the beginning marker () and the box to the right of that will be your end marker()Step 10
Within the “Fetch Page” module, ensure that “assign” is selected and NOT “emit”. Ensure the dropdown says “first” and NOT “all”. To the right of that, change the dropdown for “results to” to item.description. This is where the full content is swapped with the summary on your original RSS feed.Step 11
Almost there:) This is an optional step. If you are happy with your out put then skip this step. But you MAY want to strip links out of your story that may be inserted such as adds or reference links. You don’t want these on your blog. Do you? Within the “Regex” module, add a rule by clicking the plus sign in the module. Select item.description.content in the first box. This is the item that we are editing. Paste into the “replace” box the followingCode:<[/\]?[a]\s+[^>]*>
Don’t ask how to read regex because thats a whole tutorial on its own.Step 12
Save your Pipe by clicking the Save botton at the top right of your window. And name it.Step 13
Lets get the NEW and IMPROVED feed url. Click “Run Pipe…” at the top of the page. Now click the “Get as RSS” link and you should see your new RSS feed. Copy that url into your favorite autoblogging plugin and you will now be ripping full news story’s instead of excerpts![]()
Feel free to ask questions and GOOD LUCK !!! If you like this then don’t forget the Thanks :p
原创文章,转载请注明: 转载自互联网广 告博客
本文链接地址: 利用AutoBlogged外加Yahoo的 Pipes做英文垃圾站的方法
这是个中文图文教程,和DD转的这个流程一样:
很多网站尤其是一些新闻类的网站虽然提供RSS输出,但他们的RSS输出并不是全文输出,如果要看全文还要点击进去,而且有些文章还可能要翻墙才能 浏览,这样总感觉不是很爽……不过现在有了Yahoo Pipes就好办了!这是Yahoo提供的一款超级强大的RSS处理工具,今天俺就教你如何用它来输出网站的全文!
第一步
先在 Yahoo Pipes 里新建一个 pipe(如图)
第二步
拖入一个 Fetch Feed 模块,输入你想要全文输出的RSS地址(如图我添加的是路透社-时事要闻的RSS)
第三步
然后到Operators条目下拖入一个 Loop 模块,与 Fetch Feed 相连接
再到Sources条目拖一出个 Fetch Page 模块拖进入 Loop(注意是拖进Loop里面去,如图)
设置 URL 为 item.link
第四步
这是最关键的一步!!!
随便打开你要全文输出的RSS其中一篇文章,然后等网页加载完毕后,查看这篇网页的源代码
然后查找网页源代码中正文部分,把能囊括正文的<div=”********”>复制出来(这个div的值,是网站管理者设定的,一定 要,不然pipes不知道收录哪里,如图,路透社的<div id=”resizeableText”>)
然后填入到Fetch Page中的Cut content from中,如图
第五步
把assign项选为first,然后把results to填为item.description,将 Loop 连接到 Pipe Out,保存,大功告成!!!
最后
这是我做的路透社-时事新闻的截图
当然你要检查一下有时候item.descriptiom下可能输不出全文或乱码,那你要debug了,可能我以后会写文章另解,今天就写到这了, 如图,反正路透社这个Pipes是正常的
如果你要输出的RSS的地址美观一些,可以将 Pipes 弄好的 RSS 烧录到 Feedburner 或 feedsky
再来个例子:
http://pipes.yahoo.com/pipes/pipe.info?_id=62a4e6ddf3e2b67212d8d3ecdd4f304e
原创文章,转载请注明:转载自 Qzhu.cc










近期评论