Spring2009.CSCI470Homework3 History

Show minor edits - Show changes to markup

Changed line 1 from:

Assigned Date: Tuesday, Mar. 17, 2008\\

to:

Assigned Date: Tuesday, Mar. 17, 2009\\

Changed lines 2-4 from:

Due Date: Monday, Mar. 30
Due Time: 1:40pm

to:

Due Date: Friday, Mar. 27
Due Time: 11:55pm

Changed line 2 from:

Due Date: Thursday, Mar. 26\\

to:

Due Date: Monday, Mar. 30\\

Changed lines 46-47 from:

The agent should not revisit a page (i.e., skip already visited URLs). The agent should contain its search within the site (i.e., skip external links). Keywords are conjunctive (never "OR", always "AND").

to:

The agent should not revisit a page (i.e., should skip already visited URLs). The agent should contain its search within the site (i.e., should skip external links). Keywords are conjunctive (never "OR", always "AND").

Changed lines 63-64 from:
  1. What should a node contain? (E.g., URL of page, etc.)
to:
  1. What should a search node contain? (E.g., URL of page, etc.)
  2. What is the goal of the search ? (Think about this carefully.)
Changed lines 72-74 from:
  • Convert each histogram to a list of repeated words (feel free to adopt wordCloud.py).
  • Upload the list of repeated words to http://www.wordle.net. Use font "Duality", layout "Half and Half", and color "Blue meets Orange". Save a screenshot of the generated word cloud as a PNG. For example, here is the word cloud for this page:
to:
  • Convert each histogram to a list of repeated words (feel free to adapt wordCloud.py).
  • Upload the list of repeated words to http://www.wordle.net.
    • Use font "Duality", layout "Half and Half", and color "Blue meets Orange".
    • Save a screenshot of the generated word cloud as a PNG.
    • For example, here is the word cloud for this assignment:
Changed line 71 from:
  • Convert each histogram to a list of repeated words.
to:
  • Convert each histogram to a list of repeated words (feel free to adopt wordCloud.py).
Changed lines 20-21 from:

Write an intelligent agent that searches a website (e.g., the NY Times, or the Washington Post, etc.) to harvest information of interest. The agent will search for webpages (target pages) within this website which contain specific keywords for a given amount of time.

to:

Write an intelligent agent that searches a website (e.g., the NY Times, or the Washington Post, etc.) to harvest information of interest. The agent will be given an amount of time to search for webpages (target pages) within this website which contain specific keywords. When the time alloted is over (or when the complete website is searched, whichever comes first), the agent will output a histogram of words found on the target pages (if any).

Changed line 41 from:
  • the 50 most popular words (in reverse order, along with their cumulative frequency of occurrence across all target pages)
to:
  • the 50 most popular words (in reverse order, along with their cumulative frequency of occurrence across all target pages) - if any (it's possible that no target pages were found)
Changed lines 46-47 from:

You should avoid revisiting a page (i.e., skip already visited URLs). You should contain your search within the site (i.e., skip external links). Keywords are conjunctive (never "OR", always "AND").

to:

The agent should not revisit a page (i.e., skip already visited URLs). The agent should contain its search within the site (i.e., skip external links). Keywords are conjunctive (never "OR", always "AND").

Changed lines 54-55 from:

When constructing the histogram avoid common (stop) words.

to:

When constructing the histogram ignore common (stop) words.

Changed lines 58-59 from:

All programs that you complete in your career as a student and as a professional developer should be fully documented. Follow the Golden Rule of Style: "A program should be as easy for a human being to read and understand as it is for a computer to execute." [1] Obviously, you should comment any variable, obscure statement, block of code, method, and class you create. Your comments should express why something is being done, as opposed to how the how is shown by the code. Also, include opening comments as specified in previous assignment.

to:

All programs that you complete in your career as a student and as a professional developer should be fully documented. Obviously, you should comment any variable, obscure statement, block of code, method, and class you create. Your comments should express why something is being done, as opposed to how the how is shown by the code. Also, include opening comments as specified in previous assignment.

Changed lines 72-73 from:
  • Upload the list of repeated words to http://www.wordle.net. Use font "Duality", layout "Half and Half", and color "Blue meets Orange". Save a screenshot of the generated word cloud as a PNG. For example, there is word cloud for this page:
to:
  • Upload the list of repeated words to http://www.wordle.net. Use font "Duality", layout "Half and Half", and color "Blue meets Orange". Save a screenshot of the generated word cloud as a PNG. For example, here is the word cloud for this page:
Changed line 69 from:
  1. Four word clouds from http://www.wordle.net, nytimesObama.jpg, nytimesBush.jpg, washingtonPostObama.jpg, and washingtonPostBush.jpg. To generate them:
to:
  1. Four word clouds from http://www.wordle.net, nytimesObama.png, nytimesBush.png, washingtonPostObama.png, and washingtonPostBush.png. To generate them:
Changed lines 72-73 from:
  • Upload the list of repeated words to http://www.wordle.net. Use font "Duality", layout "Half and Half", and color "Blue meets Orange". Save a screenshot of the generated word cloud as a JPG.
to:
  • Upload the list of repeated words to http://www.wordle.net. Use font "Duality", layout "Half and Half", and color "Blue meets Orange". Save a screenshot of the generated word cloud as a PNG. For example, there is word cloud for this page:

http://www.cs.cofc.edu/~manaris/uploads/Spring2009/wordCloud.CSCI470-hmwk3.png

Changed lines 69-70 from:
  1. Search NY Times, and the Washington Post using (a) keywords "president", "Obama"; and (b) keywords "president", "Bush". Use the four histograms generated to create four word clouds at http://www.wordle.net (Hint: Use font "Duality", layout "Half and Half", and color "Blue meets Orange").
to:
  1. Four word clouds from http://www.wordle.net, nytimesObama.jpg, nytimesBush.jpg, washingtonPostObama.jpg, and washingtonPostBush.jpg. To generate them:
    • Search NY Times, and the Washington Post using (a) keywords "president", "Obama"; and (b) keywords "president", "Bush".
    • Convert each histogram to a list of repeated words.
    • Upload the list of repeated words to http://www.wordle.net. Use font "Duality", layout "Half and Half", and color "Blue meets Orange". Save a screenshot of the generated word cloud as a JPG.