LGF

-RetweetAn LGF Experiment at Carnegie Mellon?

Thu, May 15, 2008 at 10:44:31 pm PDT

Anyone know what this page at Carnegie Mellon means? It’s some kind of experiment that involves comments posted at LGF, and I have a feeling it’s not friendly.

Here’s a directory index.

UPDATE at 5/16/08 8:40:05 am:

They finally got around to closing their directory indexes; some administrator had quite a surprise this morning upon checking the server logs...

UPDATE at 5/16/08 2:49:52 pm:

A response from the researchers...

Advertisement

596 comments

  • Comments are open and unmoderated, and do not necessarily reflect the views of Little Green Footballs.
  • Obscene, abusive, silly, or annoying remarks may be deleted, but the fact that particular comments remain on the site in no way constitutes an endorsement of their views by Little Green Footballs.
  • Posts that contain phone numbers, street addresses, email addresses or other personal information will also be deleted, as will posts that consist only of a variation on the word, "First!"
  • Comments that advocate violence will be cause for immediate banning with no appeal.
  • Disagreement and debate are welcome, but insults and abuse are not, and may cause your account to be blocked.
  • REMEMBER: posting comments at LGF is a privilege, not a right. Abuse that privilege, and your account will be blocked.

Hide comments | Jump to bottom

1 pat  Thu, May 15, 2008 10:46:46pm

No, but I don't think I should. Pat Benatar. Hmmm.

2 song_and_dance_man[deleted]  Thu, May 15, 2008 10:47:18pm
3 DesertSage  Thu, May 15, 2008 10:47:42pm

Are they searching under the Denver airport yet?

4 RTLM  Thu, May 15, 2008 10:48:41pm

Gee, you guys are all famous !

5 zombie  Thu, May 15, 2008 10:48:46pm

Oh, that is weird.

Maybe it's post-modern poetry, academia-style? Using LGF comments as gibberish to transcend interpretation?

6 Render  Thu, May 15, 2008 10:49:03pm

[Link: www.cs.cmu.edu...]

A comparison of "post similarity" between LGF and Kos?

WEIRDNESS,
R

7 song_and_dance_man[deleted]  Thu, May 15, 2008 10:50:16pm
8 Dan G.  Thu, May 15, 2008 10:50:17pm

"/experiments/post_similarity/" (from the URL)

Suggests that it is an algoritm to detect people posting with multiple handle, maybe? To see if the posts' contents can be used to ID the poster w/o regard to the handle? Just guessing.

More info on this particular project...

9 zombie  Thu, May 15, 2008 10:51:56pm

Oooh, I'm in there too, several times. Here I am talking about "Anthem," the Ayn Rand novella (formatted Carnegie-Mellon stylee):

re _meta_number_ref_ zombie anthem is the only book of hers i ever recommend

you can tell you re over the target by the amount of flak you are getting _meta_end_dot_

re _meta_number_ref_ mandymanners amen

re _meta_number_ref_ zombie

re _meta_number_ref_ zombie jwm

re _meta_number_ref_ zombie anthem and it s more like a novella or long short story you can read it in a day or two _meta_end_dot_ highly recommended a completely devastating critique of collectivism and communistic society _meta_end_dot_ that s the only rand book that i have read i read it in _meta_number_ref_th grade along with animal farm

10 Dan G.  Thu, May 15, 2008 10:51:58pm

Ta Da!

Here is the paper.

11 DesertSage  Thu, May 15, 2008 10:52:00pm

That's some weird shit they got going on there.

12 morganm  Thu, May 15, 2008 10:52:00pm

You can download the python scripts one directory up and there's a Kos directory too. Probably just too what they consider to be partisan sites to test some kind of language algorithms or something, but it's kind of fun to see the actual code.

[Link: www.cs.cmu.edu...]

13 pat  Thu, May 15, 2008 10:52:38pm

Monitors
In
The
Classroom

P

14 Opilio  Thu, May 15, 2008 10:52:56pm

Hmmm. URL includes "post similarity". And a bunch of lizard names: zombie, babbazee, alouette, cattt, doriangray, savage, mandy...

Most of the snippets on the page you linked to seem to originate from lionheart/BNP period.

15 zombie  Thu, May 15, 2008 10:52:56pm

Maybe this page explains something?

16 Charles  Thu, May 15, 2008 10:53:46pm

re: #10 Dan G.

Ta Da!

Here is the paper.

But that's an old paper from 2004. The comments in the first link above are much more recent.

17 fluffy  Thu, May 15, 2008 10:53:48pm

They use python, which is more lizard oriented than perl.

A quick look made me wonder if they were grabbing comments from the lounge.

18 nextstopmars  Thu, May 15, 2008 10:53:55pm

from readme on the /experiments page

Keep only the staff need to be on the web site.
most of the real experiment results should be kept in:
malbec.ark.cs.cmu.edu/~polit/


Obviously it's a KGB plot.

19 Dan G.  Thu, May 15, 2008 10:54:38pm

The Political Blogosphere and the 2004 U.S. Election:
Divided They Blog
4 March 2005

Abstract
In this paper, we study the linking patterns and discussion topics of political bloggers. Our aim is to measure the degree of interaction between liberal and conservative blogs, and to uncover any differences in the structure of the two communities. Specifically, we analyze the posts of 40 “A-list” blogs over the period of two months preceding the U.S. Presidential Election of 2004, to study how often they referred to one another and to quantify the overlap in the topics they discussed, both within the liberal and conservative communities, and also across communities. We also study a single day snapshot of over 1,000 political blogs. This snapshot captures blogrolls (the list of links to other blogs frequently found in sidebars), and presents a more static picture of a broader blogosphere. Most significantly, we find differences in the
behavior of liberal and conservative blogs, with conservative blogs linking to each other more frequently and in a denser pattern.

20 stevieray  Thu, May 15, 2008 10:55:32pm

re: #6 Render

[Link: www.cs.cmu.edu...]

A comparison of "post similarity" between LGF and Kos?

WEIRDNESS,
R

Close. I think they are doing a text analysis looking for Charles' sock puppets -- like the way some trolls accuse Robert Spencer of posting as Hugh on his website... a way to broadcast controversial opinions yet keep your hands clean.

21 zombie  Thu, May 15, 2008 10:55:39pm

re: #10 Dan G.

Ta Da!

Here is the paper.

OK, here's the Abstract:

Abstract
In this paper, we study the linking patterns and discussion topics of political bloggers.
Our aim is to measure the degree of interaction between liberal and conservative blogs, and
to uncover any differences in the structure of the two communities. Specifically, we analyze
the posts of 40 “A-list” blogs over the period of two months preceding the U.S. Presidential
Election of 2004, to study how often they referred to one another and to quantify the overlap in
the topics they discussed, both within the liberal and conservative communities, and also across
communities. We also study a single day snapshot of over 1,000 political blogs. This snapshot
captures blogrolls (the list of links to other blogs frequently found in sidebars), and presents
a more static picture of a broader blogosphere. Most significantly, we find differences in the
behavior of liberal and conservative blogs, with conservative blogs linking to each other more
frequently and in a denser pattern.

Talk about pointless.

People get PhDs for this crap.

22 pat  Thu, May 15, 2008 10:56:03pm

Mandy will not be pleased. May give rise to foul language. Warning.

23 JustMyView  Thu, May 15, 2008 10:56:11pm

re: #16 Charles

But that's an old paper from 2004. The comments in the first link above are much more recent.

Looks like they are doing another election-year study looking at patterns of argument or other aspects of communication.

24 karmic_inquisitor  Thu, May 15, 2008 10:56:17pm

It is a correlation engine.

Here is the dictionary - frequencies posted after each word.

[Link: www.cs.cmu.edu...]

Based on the use of the term "sim" in the naming structures and the structure of the input files, I guess the engine is being run through multiple simulations.

For what purpose I don't know.

25 laZardo  Thu, May 15, 2008 10:56:31pm

I'm not in that experiment?

DISCRMIANTOIN!111

/lunch tiem, bbl

26 Charles  Thu, May 15, 2008 10:56:39pm

I don't think that paper is for the current experiment.

It's years older than the comments at the first link above.

27 Render  Thu, May 15, 2008 10:56:41pm

Following Zombie...

===

[Link: www.cs.cmu.edu...]

"Keep only the staff need to be on the web site.
most of the real experiment results should be kept in:"
malbec.ark.cs.cmu.edu/~polit/

===

EH?,
R

28 Dan G.  Thu, May 15, 2008 10:57:17pm

re: #16 Charles

But that's an old paper from 2004. The comments in the first link above are much more recent.

Perhaps they are running the experiment again for this election season, or for some other event...

29 justacanuck  Thu, May 15, 2008 10:57:17pm

Smells like they are trolling the comments looking for sockpuppets - either sockpuppets here, or socks that post here and elsewhere.

Perhaps it's a project to gather information on the posters and compare for similarities to other posting identities elsewhere, with maybe an end-goal to reveal actual the identities of the commentators here.

Nothing like using software to do the job many of us have done by hand when we suspect a sock in amongst a forum we frequent.

In light of the language in their file names and directory structure, it seems like a formal request for them to declare their intent would be reasonable, IMHO, of course.

30 The Other Les  Thu, May 15, 2008 10:57:43pm

re: #21 zombie

Piled Higher and Deeper?

31 song_and_dance_man[deleted]  Thu, May 15, 2008 10:58:36pm
32 Dan G.  Thu, May 15, 2008 10:59:01pm

re: #21 zombie

Talk about pointless.

People get PhDs for this crap.

It's mostly about determining communications/relationships amongst people with similar politics... I'm sure it would find application in the advertising industry (read spam industry)

33 Killian Bundy  Thu, May 15, 2008 10:59:13pm

re: #10 Dan G.

Ta Da!

Here is the paper.

The Political Blogosphere and the 2004 U.S. Election:
Divided They Blog

/only problem is that the comments on that page are recent

34 zombie  Thu, May 15, 2008 10:59:32pm

Ooooh, look at this page.

35 JustMyView  Thu, May 15, 2008 10:59:47pm

re: #29 justacanuck

Smells like they are trolling the comments looking for sockpuppets - either sockpuppets here, or socks that post here and elsewhere.

Perhaps it's a project to gather information on the posters and compare for similarities to other posting identities elsewhere, with maybe an end-goal to reveal actual the identities of the commentators here.

Nothing like using software to do the job many of us have done by hand when we suspect a sock in amongst a forum we frequent.

In light of the language in their file names and directory structure, it seems like a formal request for them to declare their intent would be reasonable, IMHO, of course.

I doubt they're interested in individual identities. Sounds like they're interested in things like the nature of online networks (e.g., which blogs link to which other blogs). I'm sure they would respond to an inquiry about the purpose of their study.

36 Opilio  Thu, May 15, 2008 10:59:47pm

They're using pretty recent stuff. One of the input files references about 60 threads with numbers mostly in the 28000-28600 range.

37 pat  Thu, May 15, 2008 11:00:44pm

Experiment will not work. Different blogs have different rules. Michelle deletes me with unusual vigor. Others do not care. the subject matter and initial commentary are different, as are the posters. E.G. Michelle's are boring, it is much more stimulating here.

38 zombie  Thu, May 15, 2008 11:01:16pm

re: #34 zombie

Ooooh, look at this page.

Interesting quote:

Did some experiment on comment prediction with link-plsa-lda. Withheld the comment information of 204 users from the training set. Then computed log-likelihoods of their comments on each blog post (985 post * 204 user = 201105 data points) according to the trained model. 5678 positive (i.e., there in fact is a comment by the user on the post) and 195427 negative examples. Below is the results.

Comment prediction?

/Get out of my head!

39 karmic_inquisitor  Thu, May 15, 2008 11:01:46pm

Here is the python script to run the correlation engine

[Link: www.cs.cmu.edu...]

Time to go looking at the other blogs they are running...

40 Dan G.  Thu, May 15, 2008 11:02:15pm

Yes, they are using NEW comments, but likely doing the SAME thing since the paper etc.. are all in the same project directory "TP" ("Talking Points"). It could be the collection of additional data for the data set.

41 Opilio  Thu, May 15, 2008 11:02:34pm
42 The Other Les  Thu, May 15, 2008 11:02:59pm

re: #38 zombie

Is this an attempt to pass the Turing Test?

43 Charles  Thu, May 15, 2008 11:03:02pm

Here's the paper:

[Link: www.cs.cmu.edu...]

44 Yankee Division Son  Thu, May 15, 2008 11:03:19pm

I'm no expert, but.. port sniffing?

45 pat  Thu, May 15, 2008 11:03:20pm

re: #40 Dan G.

That is the way I see it.

46 stevieray  Thu, May 15, 2008 11:03:39pm
47 gymnast  Thu, May 15, 2008 11:03:40pm

Who ever thought that garbage labeled as "research" could be so sophisticated and at the same time be so worthless. Shit like this gives PhDs a bad name for good reason. Polishing farts to get Piled higher and Deeper has become, for the most part, an ephemeral exercise in self deception.

48 karmic_inquisitor  Thu, May 15, 2008 11:03:42pm

re: #38 zombie

This isn't that advanced of stuff - we are working on a simillar concept for internal corporate communications. This has been around a while.

49 pat  Thu, May 15, 2008 11:05:35pm

What a bunch of BS. ...Fail

50 slokat  Thu, May 15, 2008 11:05:50pm

They are testing for similarity in posts? As in do we group think?

51 karmic_inquisitor  Thu, May 15, 2008 11:05:53pm

re: #43 Charles

Hey Charles - you have f'd up their model by introducing comment rating and spinoff links.

52 justacanuck  Thu, May 15, 2008 11:06:42pm

re: #21 zombie

Talk about pointless.

People get PhDs for this crap.

Perhaps its a repeat of his 2004 masterwork, (cough) now refined for the 2008 election season.

53 Sharmuta  Thu, May 15, 2008 11:06:55pm

Creepy!

54 pat  Thu, May 15, 2008 11:07:02pm

re: #51 karmic_inquisitor

Likely true. I spend most of my time on links researching now. So do others.

55 The Other Les  Thu, May 15, 2008 11:07:12pm

re: #49 pat

What a bunch of BS. ...Fail

Are they trying to figure out how we think?

56 Dan G.  Thu, May 15, 2008 11:07:25pm

re: #47 gymnast

Who ever thought that garbage labeled as "research" could be so sophisticated and at the same time be so worthless. Shit like this gives PhDs a bad name for good reason. Polishing farts to get Piled higher and Deeper has become, for the most part, an ephemeral exercise in self deception.

I don't think that this stuff is useless, it may be dated as #48 suggests, but automated compiliation of hot topics is rougly what google news is...

57 slokat  Thu, May 15, 2008 11:07:26pm

(reuter run the same story, LGF respond twice in the like mnnaer.)

58 karmic_inquisitor  Thu, May 15, 2008 11:07:41pm

re: #50 slokat

They are testing for similarity in posts? As in do we group think?

Every group does group think. The patterns are a bit disconcerting at first glance but then make sense when you think about them.

Most businesses use email to destroy original thought. I can prove that.

59 song_and_dance_man[deleted]  Thu, May 15, 2008 11:07:59pm
60 Killian Bundy  Thu, May 15, 2008 11:08:00pm

EXPERIMENTATION WITHOUT REPRESENTATION!

/I demand compensation for being used as an unwilling test subject!

61 Charles  Thu, May 15, 2008 11:08:08pm

I wonder if linking to their experiment will introduce fatal feedback loops, leading to a disturbance in the Krell mind-field and possible space-time discontinuity?

62 The Other Les  Thu, May 15, 2008 11:08:28pm

re: #58 karmic_inquisitor

Every group does group think. The patterns are a bit disconcerting at first glance but then make sense when you think about them.

Most businesses use email to destroy original thought. I can prove that.

Sounds like a future Dilbert strip.

63 pat  Thu, May 15, 2008 11:08:30pm

re: #55 The Other Les

Yes. Exactly. And to identify cross posters. All they had to do was ask.

64 JustMyView  Thu, May 15, 2008 11:09:03pm

re: #44 Yankee Division Son

I'm no expert, but.. port sniffing?

What is port sniffing?

65 zombie  Thu, May 15, 2008 11:09:17pm

Man, I'm digging through their entire directory system, including trashed/deleted files. No restrictions.

Don't these people know how to protect their directories? Amateurish.

I mean, they're doing statistical analysis of LGF comments, and they don't know thing one about Internet security? I thought these people were PhD candidates in computer science or something. Maybe statistics? Political science?

66 laZardo  Thu, May 15, 2008 11:09:25pm

re: #61 Charles

Maybe it'll start up the Large Hadron Collider and cause division by zero.

/OH SHI-

//actually goes to lunch nao

67 Charles  Thu, May 15, 2008 11:09:42pm

re: #65 zombie

Man, I'm digging through their entire directory system, including trashed/deleted files. No restrictions.

Don't these people know how to protect their directories? Amateurish.

I mean, they're doing statistical analysis of LGF comments, and they don't know thing one about Internet security? I thought these people were PhD candidates in computer science or something. Maybe statistics? Political science?

I know. It's wide open. Amazing.

68 Dan G.  Thu, May 15, 2008 11:09:45pm

re: #51 karmic_inquisitor

Hey Charles - you have f'd up their model by introducing comment rating and spinoff links.

I wonder if this thread screws up his project, if the subjects (i.e. us) know they're being experimented on it kind of screws up many experiements...

69 karmic_inquisitor  Thu, May 15, 2008 11:09:49pm

re: #64 JustMyView

What is port sniffing?

A technique to see what traffic is going over a network port. Used by hackers and network administrators alike.

70 pat  Thu, May 15, 2008 11:10:04pm

This program could identify sock puppets, but won't. Because it is a dead end. As Karmic says.

71 Killian Bundy  Thu, May 15, 2008 11:10:26pm

re: #64 JustMyView

What is port sniffing?

/turn your head to the left and breathe through your nose

72 Dan G.  Thu, May 15, 2008 11:11:06pm

re: #65 zombie

Man, I'm digging through their entire directory system, including trashed/deleted files. No restrictions.

Don't these people know how to protect their directories? Amateurish.

I mean, they're doing statistical analysis of LGF comments, and they don't know thing one about Internet security? I thought these people were PhD candidates in computer science or something. Maybe statistics? Political science?


Computer Science

73 The Other Les  Thu, May 15, 2008 11:11:14pm

re: #61 Charles

I wonder if linking to their experiment will introduce fatal feedback loops, leading to a disturbance in the Krell mind-field and possible space-time discontinuity?

Um... didn't they blow up a planet that way?

74 gymnast  Thu, May 15, 2008 11:11:24pm

re: #56 Dan G.

I don't think that this stuff is useless, it may be dated as #48 suggests, but automated compiliation of hot topics is rougly what google news is...

And Google news has what qualities that you are willing to allow it to do your thinking for you? Are you contextually illiterate?

75 haakondahl  Thu, May 15, 2008 11:11:25pm

Keep only the staff need to be on the web site.
most of the real experiment results should be kept in:
malbec.ark.cs.cmu.edu/~polit/

76 zombie  Thu, May 15, 2008 11:11:50pm

re: #43 Charles

Here's the paper:

[Link: www.cs.cmu.edu...]

Abstract
In this work, we address the twin problems of unsupervised
topic discovery and estimation of topic specific influence of
blogs. We propose a new model that can be used to provide a
user with highly influential blog postings on the topic of the
user’s interest.

Idiots. Using computer algorithms to determine how to made a good blog post?

How about this for a clue-byte, you bozos: Have political acumen!

77 pat  Thu, May 15, 2008 11:12:09pm

Did pat say any thing snappy?

78 JustMyView  Thu, May 15, 2008 11:12:16pm

re: #56 Dan G.

I don't think that this stuff is useless, it may be dated as #48 suggests, but automated compiliation of hot topics is rougly what google news is...

Right. There are all sorts of reasons to be developing systems and tools to analyze large text databases. I have no idea what they're searching for, but it doesn't appear to be anything having to do w/ the content of the posts here.

79 victor_yugo  Thu, May 15, 2008 11:12:19pm

[Link: www.cs.cmu.edu...]

(This link WILL become invalid soon!)

It looks to my un-Pythonesque eye like a programmer is using the cosine function to create some kind of mapping on how comments refer to one another based on comment location in the thread; and looking at different types of postings from Charles (open thread, breaking news, news outlet criticisms) to see if there are any trends in the discussions that vary according to the posting type.

That has to be one of the worst run-on sentences I've ever written.

80 Opilio  Thu, May 15, 2008 11:12:25pm

From one of the files:

topic 003
3253 mccain
2526 huckabe
4497 romnei
4527 rudi
2229 gop
2194 giuliani
3368 mitt
2525 huck
5310 think
3323 mike

topic 006
2212 god
4350 religion
864 christian
445 believ
4352 religi
875 church
3412 mormon
1882 faith
4384 republican
302 atheist

81 The Other Les  Thu, May 15, 2008 11:12:27pm

re: #65 zombie

Man, I'm digging through their entire directory system, including trashed/deleted files. No restrictions.

Don't these people know how to protect their directories? Amateurish.

I mean, they're doing statistical analysis of LGF comments, and they don't know thing one about Internet security? I thought these people were PhD candidates in computer science or something. Maybe statistics? Political science?

At the level of dumb I'd have to say it was Political Science.

82 bosforus  Thu, May 15, 2008 11:12:56pm

who is lionheart?

83 Dan G.  Thu, May 15, 2008 11:13:05pm

re: #74 gymnast

And Google news has what qualities that you are willing to allow it to do your thinking for you? Are you contextually illiterate

Fetching news for me isn't thinking for me... "contextually illiterate"... define that please.

84 justacanuck  Thu, May 15, 2008 11:13:18pm

re: #65 zombie

Man, I'm digging through their entire directory system, including trashed/deleted files. No restrictions.

Don't these people know how to protect their directories? Amateurish.

I mean, they're doing statistical analysis of LGF comments, and they don't know thing one about Internet security? I thought these people were PhD candidates in computer science or something. Maybe statistics? Political science?

A number of the links to pdf reports and results papers are broken.

And on a completely unrelated topic, WinHTTrack is a wonderful utility for copying entire html sites. ;-)

85 redc1c4  Thu, May 15, 2008 11:13:19pm

re: #61 Charles

I wonder if linking to their experiment will introduce fatal feedback loops, leading to a disturbance in the Krell mind-field and possible space-time discontinuity?

one can only hope...

or maybe it will result in "Change!" %-)

86 The Other Les  Thu, May 15, 2008 11:13:25pm

re: #68 Dan G.

I wonder if this thread screws up his project, if the subjects (i.e. us) know they're being experimented on it kind of screws up many experiements...

The observer is part of the event being observed.

87 haakondahl  Thu, May 15, 2008 11:13:48pm

Don't remember how to ask a Unix about a particular user.

88 Ed Driscoll  Thu, May 15, 2008 11:13:55pm

Am I the only person who looked at the page that Charles linked to, and had flashbacks to this?

89 slokat  Thu, May 15, 2008 11:14:01pm
90 karmic_inquisitor  Thu, May 15, 2008 11:14:07pm

Check out this breakdown of how they categorize the nature of blog posts ...

[Link: www.cs.cmu.edu...]

91 JustMyView  Thu, May 15, 2008 11:14:11pm

re: #65 zombie

Man, I'm digging through their entire directory system, including trashed/deleted files. No restrictions.

Don't these people know how to protect their directories? Amateurish.

I mean, they're doing statistical analysis of LGF comments, and they don't know thing one about Internet security? I thought these people were PhD candidates in computer science or something. Maybe statistics? Political science?

Can you operate on their files or can you just look at them? Maybe they don't care about keeping them private.

92 redc1c4  Thu, May 15, 2008 11:14:32pm

re: #64 JustMyView

What is port sniffing?

after you pour some in the glass, but before you take a sip, swirl it around a bit and inhale whist the dark chocolate is melting on your tongue...

93 pat  Thu, May 15, 2008 11:14:51pm

re: #80 Opilio

The first is likely one of Charles polls, but the latter are word amalgamations. Will not tell anything without context.
/obama

94 haakondahl  Thu, May 15, 2008 11:14:54pm

re: #81 The Other Les

At the level of dumb I'd have to say it was Political Science.

It is in fact.

95 JustMyView  Thu, May 15, 2008 11:14:59pm

re: #69 karmic_inquisitor

A technique to see what traffic is going over a network port. Used by hackers and network administrators alike.

Thanks.

96 Charles  Thu, May 15, 2008 11:14:59pm

Somebody's in for a surprise when they check the server logs.

97 victor_yugo  Thu, May 15, 2008 11:15:08pm

re: #83 Dan G.

Fetching news for me isn't thinking for me...

When Google News decides what news is worth fetching, and ignores important truth because it fails to align with their intentions...

It is thinking for you.

98 slokat  Thu, May 15, 2008 11:15:35pm

A blog related to the author

99 Sharmuta  Thu, May 15, 2008 11:15:44pm

re: #82 bosforus

Put it in the LGF search- that's all I care to say about him.

100 gymnast  Thu, May 15, 2008 11:15:45pm

re: #83 Dan G.

Fetching news for me isn't thinking for me... "contextually illiterate"... define that please.

You just gave a pretty good example.

101 Dan G.  Thu, May 15, 2008 11:15:54pm

re: #87 haakondahl

Don't remember how to ask a Unix about a particular user.

"finger" is one way.

102 JustMyView  Thu, May 15, 2008 11:15:56pm

re: #92 redc1c4

after you pour some in the glass, but before you take a sip, swirl it around a bit and inhale whist the dark chocolate is melting on your tongue...

Great! I like this definition better than the one I got from karmic inquisitor. Wish you had sent some port along with it though.

103 zombie  Thu, May 15, 2008 11:16:12pm

re: #72 Dan G.

Computer Science

So it IS the the Computer Science Department.

...


FAIL.

104 haakondahl  Thu, May 15, 2008 11:16:32pm
105 victor_yugo  Thu, May 15, 2008 11:16:36pm

re: #96 Charles

Somebody's in for a surprise when they check the server logs.

BWAAAHAHAHAHAHAHAHAHA

106 pat  Thu, May 15, 2008 11:16:36pm

Whoa. Slokat, good work. You sure?

107 justacanuck  Thu, May 15, 2008 11:16:47pm

re: #96 Charles

Somebody's in for a surprise when they check the server logs.

Yep. Those damn Canucks...

108 Dan G.  Thu, May 15, 2008 11:17:01pm

re: #100 gymnast

You just gave a pretty good example.

Put the definition into a sentence.

109 JHW  Thu, May 15, 2008 11:17:40pm

Schrodingers cat? Not that I know very much about what I'm referencing.

110 guzziguy  Thu, May 15, 2008 11:18:03pm

re: #79 victor_yugo

[Link: www.cs.cmu.edu...]

(This link WILL become invalid soon!)

It looks to my un-Pythonesque eye like a programmer is using the cosine function to create some kind of mapping on how comments refer to one another based on comment location in the thread; and looking at different types of postings from Charles (open thread, breaking news, news outlet criticisms) to see if there are any trends in the discussions that vary according to the posting type.

That has to be one of the worst run-on sentences I've ever written.

Not too different from the "tag storm" provided here?

111 slokat  Thu, May 15, 2008 11:18:06pm

Must be a programmer

Python
&
Knitting?

112 redc1c4  Thu, May 15, 2008 11:18:06pm

re: #65 zombie

Man, I'm digging through their entire directory system, including trashed/deleted files. No restrictions.

Don't these people know how to protect their directories? Amateurish.

I mean, they're doing statistical analysis of LGF comments, and they don't know thing one about Internet security? I thought these people were PhD candidates in computer science or something. Maybe statistics? Political science?

copy everything, if you've got the disk space... could be useful, could be educational for them...

/just a thought %-)

113 bosforus  Thu, May 15, 2008 11:18:27pm

re: #99 Sharmuta

Put it in the LGF search- that's all I care to say about him.

I figured it had to be a username but I've never seen so I wasn't positive. Thanks.

114 karmic_inquisitor  Thu, May 15, 2008 11:18:54pm

Checkout the dictionary for dkos.

[Link: www.cs.cmu.edu...]

They sure use the term "cheney" a lot.

I wonder why? /S

115 zombie  Thu, May 15, 2008 11:19:20pm

Here's the author's personal (non-academic) web site:

Fashion Projects

Has this tidbit:

Tae Yano is a software engineer. She is completing her PhD in computer Science at Carnegie Mellon.

It's a she!

116 victor_yugo  Thu, May 15, 2008 11:19:22pm

re: #104 haakondahl

I love the page title:

Litttle math kills no one

117 Dan G.  Thu, May 15, 2008 11:19:33pm

We could coordinate with the other conservative blogs that its watching and spoof a result that would trigger the program...

118 haakondahl  Thu, May 15, 2008 11:19:38pm

re: #101 Dan G.

"finger" is one way.

user1@user1-desktop:~$ finger taey@cs.cmu.edu
[cs.cmu.edu]
One entry found for exact uid match
Login: taey Name: Tae Yano
Directory: /afs/cs.cmu.edu/user/taey
Mail is forwarded to taey+@imap.srv.cs.cmu.edu
No Plan

119 bosforus  Thu, May 15, 2008 11:19:39pm

Charles, are you saving a copy of their directory?

120 Sharmuta  Thu, May 15, 2008 11:19:47pm

re: #113 bosforus

I figured it had to be a username but I've never seen so I wasn't positive. Thanks.

You could use the Tag Storm too.

121 Dan G.  Thu, May 15, 2008 11:20:03pm

re: #115 zombie

Here's the author's personal (non-academic) web site:

Fashion Projects

Has this tidbit:

It's a she!

DOH!

122 Charles  Thu, May 15, 2008 11:20:26pm

Notice the title of this page:

[Link: www.cs.cmu.edu...]

"Shaking a bush. Poking a shrub."

123 gymnast  Thu, May 15, 2008 11:20:31pm

re: #108 Dan G.

Put the definition into a sentence.

Look it up. It's in plain English.

124 zombie  Thu, May 15, 2008 11:20:38pm
125 zombie  Thu, May 15, 2008 11:21:14pm

re: #91 JustMyView

Can you operate on their files or can you just look at them? Maybe they don't care about keeping them private.

I'd need the admin password for that.

126 pat  Thu, May 15, 2008 11:21:22pm

You can have her

127 bystander  Thu, May 15, 2008 11:21:34pm

Charles,
My guess is somebody's learning about Bayesian Classification indexes in an Algorithms CompSci class and decided to datamine your site. To what end, I don't know.

The least they could have done is told you, before the fact, that they'd be crawling your site for data. Or respected your robots file.

128 haakondahl  Thu, May 15, 2008 11:21:35pm

re: #117 Dan G.

We could coordinate with the other conservative blogs that its watching and spoof a result that would trigger the program...

The experiment seems to be an attempt to predict which users will post on a given topic, perhaps given the topic itself, perhaps given who has already posted, perhaps both. This is being run against Daily Kos (I presume that's what DK is for) and LGF.

129 Nightwatch  Thu, May 15, 2008 11:21:38pm

AAARRRGGGHHH!
What the HELL, did I just look at Hillary in the buff? DAMN my eyes are burning, CHARLES!...WTF? ATROPINE! ATROPINE! AUTO-INJECT NOW! SECURITY OUT!

130 Charles  Thu, May 15, 2008 11:21:40pm

Lots of blogs listed here:

[Link: www.cs.cmu.edu...]

131 zombie  Thu, May 15, 2008 11:21:47pm

re: #96 Charles

Somebody's in for a surprise when they check the server logs.

I don't think she knows how to check the server logs.

132 Dan G.  Thu, May 15, 2008 11:22:10pm

re: #118 haakondahl

user1@user1-desktop:~$ finger taey@cs.cmu.edu
[cs.cmu.edu]
One entry found for exact uid match
Login: taey Name: Tae Yano
Directory: /afs/cs.cmu.edu/user/taey
Mail is forwarded to taey+@imap.srv.cs.cmu.edu
No Plan


Since security has been a second thought for Tae try

echo "We see you too. LGF" > /afs/cs.cmu.edu/user/taey/.plan

It will display whenever the screen saver kicks on (if her desktop is in fact a *nix based system).

133 hermeneutics  Thu, May 15, 2008 11:22:15pm

This is not malicious, but standard academic "stuff." Some poor doctoral student, probably in computer programming/software, is desperate to complete dissertation research.

134 slokat  Thu, May 15, 2008 11:22:34pm

re: #106 pat

Just going up the tree

135 justacanuck  Thu, May 15, 2008 11:22:34pm

re: #115 zombie

Here's the author's personal (non-academic) web site:

Fashion Projects

Has this tidbit:


It's a she!

Well she is gonna have a conniption when she reads her server logs.
Damn she's got a gazillion files on there ...

136 zombie  Thu, May 15, 2008 11:23:06pm

More info on her:

Tae Yano
Ph.D. student, LTI
NLP in the political domain
Japanese, English, Spanish, French English
C, C++, Java, Perl, Python

NLP = Neuro-linguistic programming?

137 Render  Thu, May 15, 2008 11:23:19pm

re: #43 Charles

I think that might be paper #2 of three, dated '06. Zombie found paper #1, dated '04. The initial posted link has posts from Jan-Feb of '08.

Yet to be mapped/written?

ONGOING,
R

138 Charles  Thu, May 15, 2008 11:23:19pm

I tried to log in with anonymous FTP, but no go.

139 Mel Lono  Thu, May 15, 2008 11:23:27pm
2 Methodology
In order to get a representative view of the liberal and conservative blog communities, we cast our
nets wide and gathered a single day’s snapshot of over a thousand political blogs. Since we also
wanted to do a careful study of the heart of the political blogosphere, we then analyzed the posts
for the two months preceding the election with a smaller set of 40 influential blogs.

Fatal flaw

140 Charles  Thu, May 15, 2008 11:23:48pm

re: #136 zombie

More info on her:


NLP = Neuro-linguistic programming?

Yep, that's what it means.

141 redc1c4  Thu, May 15, 2008 11:24:19pm

re: #102 JustMyView

Great! I like this definition better than the one I got from karmic inquisitor. Wish you had sent some port along with it though.

i did, but it obviously went into the wrong port on your end...

may i suggest Robert Hall port? not sure where you live, but it's very nice stuff, and they ship. %-)

142 song_and_dance_man[deleted]  Thu, May 15, 2008 11:24:26pm
143 zombie  Thu, May 15, 2008 11:24:33pm

re: #122 Charles

Notice the title of this page:

[Link: www.cs.cmu.edu...]

"Shaking a bush. Poking a shrub."

Good catch!

144 victor_yugo  Thu, May 15, 2008 11:24:55pm

re: #122 Charles

That page has (had) links to compare/contrasts between LGF and Kos. Alas, all links to results are dead.

145 Dan G.  Thu, May 15, 2008 11:24:56pm

re: #136 zombie

More info on her:

NLP = Neuro-linguistic programming?

Probably NLP

146 JustMyView  Thu, May 15, 2008 11:25:55pm

re: #136 zombie

More info on her:


NLP = Neuro-linguistic programming?

Yes, I'm sure that's right. CMU has a big CS department, and they do a lot of work in that area.

147 zombie  Thu, May 15, 2008 11:26:05pm

re: #138 Charles

I tried to log in with anonymous FTP, but no go.

I was gonna try that.

I'll put on my neuro-linguistic programming hat and PREDICT what she'd use for her password.

148 MrArchieBunker  Thu, May 15, 2008 11:26:15pm

I looked at all the links and couldn't make heads or tails out of it, but then again I'm not a techie. It would seem to me the custodians working the night shift at Carnagie-Mellon are engaged in more productive and usefeul work.

149 Yankee Division Son  Thu, May 15, 2008 11:26:33pm

re: #96 Charles

Somebody's in for a surprise when they check the server logs.

You're assuming they actually check the logs.. with directory browsing enabled and permissions open I tend to doubt it..

150 justacanuck  Thu, May 15, 2008 11:26:42pm

re: #138 Charles

I tried to log in with anonymous FTP, but no go.

I can ftp to you what I'm collecting...

151 bystander  Thu, May 15, 2008 11:26:57pm

Yep. looks like she's crawling lots of political sites on both the left and right side of the scale:

[Link: www.cs.cmu.edu...]

AMERICAblog_html/ 23-Dec-2007 19:39 -
BAGnewsNotes_html/ 26-Dec-2007 11:16 -
BelmontClub_html/ 23-Dec-2007 17:21 -
BlogForBush_html/ 24-Dec-2007 18:09 -
CaptainsQuarters_html/ 20-Dec-2007 21:25 -
CarpetBagger_html/ 20-Dec-2007 21:11 -
CrooksAndLiers_html/ 20-Dec-2007 20:49 -
DailyDish_html/ 26-Dec-2007 12:33 -
DailyKos_html/ 15-Dec-2007 02:16 -
Eschaton_html/ 27-Dec-2007 10:23 -
HughHewitt_html/ 23-Dec-2007 22:05 -
Instapundit_html/ 17-Dec-2007 13:16 -
JuanCole_html/ 24-Dec-2007 19:16 -
LeftCoaster_html/ 20-Dec-2007 21:01 -
MV_html/ 22-Dec-2007 09:14 -
MarcAmbinder_html/ 24-Dec-2007 08:40 -
MatthewYglesias_html/ 23-Dec-2007 20:49 -
MichelleMalkin_html/ 21-Dec-2007 10:48 -
MotherJones_html/ 17-Dec-2007 12:25 -
MyDD_html/ 23-Dec-2007 21:08 -
NRO_html/ 21-Dec-2007 11:47 -
Newsbuster_html/ 17-Dec-2007 10:26 -
NoLeftTurn_html/ 24-Dec-2007 10:28 -
OliverWillis_html/ 23-Dec-2007 21:24 -
PoliPundit_html/ 24-Dec-2007 10:38 -
PowerLine_html/ 17-Dec-2007 12:39 -
RCP_html/ 23-Dec-2007 21:38 -
Redstate_html/ 17-Dec-2007 11:43 -
RightWingNews_html/ 25-Dec-2007 00:37 -
RightWingNut_html/ 24-Dec-2007 10:55 -
RogerLSimon_html/ 25-Dec-2007 01:16 -
ScrappleFace_html/ 25-Dec-2007 01:29 -
TPM_html/ 17-Dec-2007 10:50 -
TPMmuckraker_html/ 23-Dec-2007 21:52 -
TalkLeft_html/ 20-Dec-2007 20:29 -
ThinkProgress_html/ 24-Dec-2007 18:10 -
master.txt


BTW, here's her "about me" page:
[Link: people.cs.cmu.edu...]

152 slokat  Thu, May 15, 2008 11:27:02pm

Found a paper on teaching third and fourth graders how to program using three letter commands... cool stuff.

She did spend a lot of time doing maps about liberal sites.

153 karmic_inquisitor  Thu, May 15, 2008 11:27:03pm

BTW - they have selected a sample of posts from LGF and DKos respectively, pairing the posts based on general topic.

When they run the simulation, they do it off of pre-distilled pages where they have reduced the page by throwing out html markup, ads and other content, and then further reduce the comment lines to eliminate words that are so common that they are meaningless.

They then build a dictionary for each site (one for kos, one for LGF) and score the words on frequency and relative importance (their formulaic version of relative importance).

The rest of the processing I assume to be a comparison of the dictionaries for commonality, and then submission of a new post for "scoring" to determine if the new post is a candidate for a "hot post" or one that should be skipped over. One could also predict which audience is likely to want the camdidate - if it has the word "Cheney" send it to the Kos pile, if it has "Quadaffi" send it to LGF.

154 Dan G.  Thu, May 15, 2008 11:27:13pm

re: #147 zombie

I was gonna try that.

I'll put on my neuro-linguistic programming hat and PREDICT what she'd use for her password.

My NLP hat predicts that the password is... ""

155 Charles  Thu, May 15, 2008 11:27:27pm

I can see several avenues of attack that would probably let someone get full access to their server. Not that I would do something like that, of course.

156 haakondahl  Thu, May 15, 2008 11:27:31pm

re: #138 Charles

I tried to log in with anonymous FTP, but no go.

Ditto. CR

157 JustMyView  Thu, May 15, 2008 11:27:52pm

re: #141 redc1c4

i did, but it obviously went into the wrong port on your end...

may i suggest Robert Hall port? not sure where you live, but it's very nice stuff, and they ship. %-)

I'll keep that in mind next time I shop. Thanks so much!

158 victor_yugo  Thu, May 15, 2008 11:28:00pm

Interesting data plot here. (PDF)

159 Dan G.  Thu, May 15, 2008 11:28:04pm

oops... should have read "spacebar"

160 stevieray  Thu, May 15, 2008 11:28:27pm

I wonder if [carnagie-mellon] we could throw off [carnagie-mellon] their delicate algorithms [carnagie-mellon] by tossing in random [carnagie-mellon] words?

161 Opilio  Thu, May 15, 2008 11:28:30pm

This file seems to contain a list of words parsed from a set of LGF comments along with a couple of frequency counters, probably # of instances, and # of different threads.

Dictionary of LGF vocabulary

For example:

smurf 28 9
smurfed 1 1
smurfette 12 8
smurfettes 2 2
smurfish 1 1
smurfs 9 7
smurftastic 1 1

162 zombie  Thu, May 15, 2008 11:28:36pm

re: #145 Dan G.

Probably NLP

Quote:

neuro-linguistic programming, a field stated by its founders to provide a way to examine and decompose human subjective experience and communication into component parts, generate models from these, and work with them.

Ahh, so basically it's a merger between post-Marxist deconstructionism and THX-1138-esque Orwellian personality dehumanization.

Nice.

163 redc1c4  Thu, May 15, 2008 11:28:47pm

re: #108 Dan G.

Put the definition into a sentence.

"One should always include the definition of a new word when dealing with students just entering into the program."

there ya go...

LGF is the original tough crowd. if you can't fact check yo' own ass, ya really shouldn't expect other lizards to do it for ya.

/google

164 bystander  Thu, May 15, 2008 11:29:48pm

re: #153 karmic_inquisitor

They then build a dictionary for each site (one for kos, one for LGF) and score the words on frequency and relative importance (their formulaic version of relative importance).
The rest of the processing I assume to be a comparison of the dictionaries for commonality, and then submission of a new post for "scoring" to determine if the new post...


Excellent deduction! That is the very definition of a Bayesian Classification index.

165 The Other Les  Thu, May 15, 2008 11:29:49pm

re: #155 Charles

I can see several avenues of attack that would probably let someone get full access to their server. Not that I would do something like that, of course.

"We could get the money..." (Looks up and pulls down the microphone.) "But that would be wrong."

166 Dan G.  Thu, May 15, 2008 11:29:57pm

re: #162 zombie

Ahh, so basically it's a merger between post-Marxist deconstructionism and THX-1138-esque Orwellian personality dehumanization.

Nice.

Yes... the "linguistic" part should have given it away... Neural networks, I believe, are what the non-marxists in the field study.

167 Killian Bundy  Thu, May 15, 2008 11:30:06pm

HOME. of Tae Yano

/what the hell is Google on about today?

168 The Other Les  Thu, May 15, 2008 11:30:33pm

re: #160 stevieray

I wonder if [carnagie-mellon] we could throw off [carnagie-mellon] their delicate algorithms [carnagie-mellon] by tossing in random [carnagie-mellon] words?

I'd go with "fnord." on that one.

169 redc1c4[deleted]  Thu, May 15, 2008 11:30:35pm
170 victor_yugo  Thu, May 15, 2008 11:31:14pm

re: #163 redc1c4

LGF is the original tough crowd. if you can't fact check yo' own ass, ya really shouldn't expect don't be surprised when the other lizards to do it for ya.

Fixed.

171 Charles  Thu, May 15, 2008 11:31:28pm

Come on. Let's not post disparaging comments about people's looks. Chances are very good they're going to read this thread.

172 haakondahl  Thu, May 15, 2008 11:32:04pm

re: #164 bystander

Excellent deduction! That is the very definition of a Bayesian Classification index.

I liked the method of withdrawing all posts by 204 commenters, then asking the system to predict what those posts were (or *if* the post existed, I think) based on all the rest.

173 victor_yugo  Thu, May 15, 2008 11:32:07pm

re: #168 The Other Les

Go with what?

You have some empty quotes in there.

174 Dan G.  Thu, May 15, 2008 11:32:12pm

re: #163 redc1c4

"One should always include the definition of a new word when dealing with students just entering into the program."

there ya go...

LGF is the original tough crowd. if you can't fact check yo' own ass, ya really shouldn't expect other lizards to do it for ya.

/google

I can fact check, but I'm not going to chase down a phrase that uses the word "context" without any context... especially when it is slung at me as an ad hominem.

175 haakondahl  Thu, May 15, 2008 11:33:07pm

re: #171 Charles

Come on. Let's not post disparaging comments about people's looks. Chances are very good they're going to read this thread.

Chances are very good that they're going to turn that comment in with their Dissertation.

176 JustMyView  Thu, May 15, 2008 11:33:49pm

Spelling tip: Carnegie Mellon University, not Carnegie-Mellon University. No hyphen. Not that it particularly matters.

177 The Other Les  Thu, May 15, 2008 11:33:53pm

re: #173 victor_yugo

Go with what?

You have some empty quotes in there.

Really? [fnord[ I didn't [fnord] notice that. [fnord]

178 pat  Thu, May 15, 2008 11:34:10pm

Besides, if she is a fashion guru, she is likely a stunner when dressed for the dance.

179 zombie  Thu, May 15, 2008 11:34:45pm

re: #171 Charles

Chances are very good they're going to read this thread.

Think so?

OK, if that's the case:

---

Tae, you really need to learn how to password-protect your directories. Otherwise people on the Web will be able to see all your dirty laundry and hidden stuff.

Outside the ivory tower of academia, it's a rough-and-tumble world out here.

---

180 bosforus  Thu, May 15, 2008 11:35:00pm

I'm enjoying the graphs. They're pretty.

181 itellu3times  Thu, May 15, 2008 11:35:01pm

re: #136 zombie

NLP = Neuro-linguistic programming?

Carnegie-Mellon, I expect it's simply natural language processing.

182 karmic_inquisitor  Thu, May 15, 2008 11:35:23pm

For all we know this could actually be an experiement out of the sociology department where a fake CS experiment is set up and left "wide open" to see which blog would find it first.

/I know - but it is late.

183 Dan G.  Thu, May 15, 2008 11:35:45pm

re: #179 zombie

Think so?

OK, if that's the case:

---

Tae, you really need to learn how to password-protect your directories. Otherwise people on the Web will be able to see all your dirty laundry and hidden stuff.

Outside the ivory tower of academia, it's a rough-and-tumble world out here.

---

What's more Tae, your competitors can easily scoop you if you are not careful.

184 itellu3times  Thu, May 15, 2008 11:35:53pm

pardon my hyphen

185 The Other Les  Thu, May 15, 2008 11:36:41pm

re: #178 pat

Besides, if she is a fashion guru, she is likely a stunner when dressed for the dance.

I'm going through the archive of her fashion blog. She could be a stunner but I wouldn't bet on that.

186 karmic_inquisitor  Thu, May 15, 2008 11:36:50pm

re: #179 zombie

I can see why an academic doing this sort of research would leave things open and then observe the reaction.

i hope she has everything backed up.

I am curious what conclusions she has drawn.

187 kuchuklambat  Thu, May 15, 2008 11:36:52pm

ok here's how I see it. There is a whole group at Carnegie Mellon CS department doing research in Artificial Intelligence by running statistics on blogs. A grad student Tae Yano stashed her research files on a server and left them wide open to WWW. The files pertain to running basic stats on DK and LGF, up to cosine similarity.
[Link: www.cs.cmu.edu...]
is a summary of points (haven't scoured it for bias).

My favorite picture is [Link: www.cs.cmu.edu...]

The AI goal would be to have a blog-commenting software indistinguishable from a blog-commenting human. For all I know, this, or any other comment, is already written by a python script.

188 song_and_dance_man[deleted]  Thu, May 15, 2008 11:37:00pm
189 slokat  Thu, May 15, 2008 11:37:07pm

http://www.cs.cmu.edu/~taey/TP/links0.html

If you want to know what threads were used in the comparison

190 itellu3times  Thu, May 15, 2008 11:37:08pm

bush bush bush bush rove hillary dillary dock obama osama

191 zombie  Thu, May 15, 2008 11:37:27pm

re: #181 itellu3times

Carnegie-Mellon, I expect it's simply natural language processing.

You're right.

Here's her advisor's page. He's supervising this thesis.

From his page:

Research interests:
Natural language processing (morphological, syntactic, semantic, and pragmatic analysis, algorithms, multilingual methods)
Machine learning (statistical languages, unsupervised grammar induction, structured prediction)
NLP applications (translation, information aggregation, the social sciences)
192 MrArchieBunker  Thu, May 15, 2008 11:37:28pm

What in the world is her purpose here? I just dont get it. Could a techie break it down?

193 Killian Bundy  Thu, May 15, 2008 11:38:05pm

Wjat's the deal with Google?

/seriously

194 theheat[deleted]  Thu, May 15, 2008 11:38:29pm
195 victor_yugo  Thu, May 15, 2008 11:38:58pm

re: #180 bosforus

I'm enjoying the graphs. They're pretty.

I made this one about a year and a half ago. It charts comment volume by thread, over time.

Opening that spreadsheet still takes almost 5 minutes, even on a 2-CPU system.

196 slokat  Thu, May 15, 2008 11:39:17pm

re: #180 bosforus

I was trying to figure out what that one graphed...

197 karmic_inquisitor  Thu, May 15, 2008 11:39:18pm

BTW - CM does a shit load of research in cooperation with companies that pay for it. I know Seimens has a bunch of projects there. There is a chance that someone is paying for some of this.

Who would want to distill political blogs?

198 Render  Thu, May 15, 2008 11:39:24pm

To what end?

Building a comment bot?

=

Asian females with large brains are sexy.

DISAVOW
ANY
STATEMENTS,
R

199 The Other Les  Thu, May 15, 2008 11:39:30pm

re: #178 pat

She uses the phrase "sustainable fashion."

That is not a good sign.

200 stevieray  Thu, May 15, 2008 11:39:46pm

re: #176 JustMyView

Spelling tip: Carnegie Mellon University, not Carnegie-Mellon University. No hyphen. Not that it particularly matters.

You're correct, but this experiment seems to look for single words, so I wanted it to show up as one word, not a phrase. I probably should have used the "_" instead... or better yet, I should just get some sleep. Dawn comes soon enough for the working class.

Catch y'all tomorrow.

201 bosforus  Thu, May 15, 2008 11:39:55pm

re: #192 MrArchieBunker

What in the world is her purpose here? I just dont get it. Could a techie break it down?

I'm pretty sure all of the data eventually gets summarized by an MSM talking head who says, Have you heard about blogs? Apparently they're better at our job than we are.

202 karmic_inquisitor  Thu, May 15, 2008 11:39:56pm

re: #193 Killian Bundy

read my 197.

203 zombie  Thu, May 15, 2008 11:39:59pm

re: #187 kuchuklambat

The AI goal would be to have a blog-commenting software indistinguishable from a blog-commenting human. For all I know, this, or any other comment, is already written by a python script.

I really like taking pictures of moonbatsbatsbatsbatsss...

...

zombie kernel panic.

Reboot zombie mode.

/

End.

204 itellu3times  Thu, May 15, 2008 11:40:02pm

re: #192 MrArchieBunker

What in the world is her purpose here? I just dont get it. Could a techie break it down?

well, I glanced at the paper, and still have to guess, but it's some kind of attempt to be able to let some robot read a blog and determine what people are talking about by using keywords rather than really trying to understand what's being said.

CIA/NSA love this kind of stuff.

205 pat  Thu, May 15, 2008 11:40:20pm

re: #199 The Other Les

She uses the phrase "sustainable fashion."

That is not a good sign.

LOL. Good Point. Actually laughing out loud here.

206 Dan G.  Thu, May 15, 2008 11:40:39pm

Nite all,

Nice meeting you Tae, good luck with your research... ;)

207 bosforus  Thu, May 15, 2008 11:40:56pm

re: #196 slokat

I was trying to figure out what that one graphed...

At first I thought it was a ranking from Alexa but Kos' not so. I think maybe the y-axis is a percentage, maybe of comments, but I have no idea how to interpret all the numbers above the graph.

208 song_and_dance_man[deleted]  Thu, May 15, 2008 11:41:01pm
209 Sharmuta  Thu, May 15, 2008 11:41:10pm

re: #161 Opilio

This file seems to contain a list of words parsed from a set of LGF comments along with a couple of frequency counters, probably # of instances, and # of different threads.

Dictionary of LGF vocabulary

For example:

smurf 28 9
smurfed 1 1
smurfette 12 8
smurfettes 2 2
smurfish 1 1
smurfs 9 7
smurftastic 1 1

LMAO! That is smurftastic!

210 Killian Bundy  Thu, May 15, 2008 11:41:16pm

re: #202 karmic_inquisitor

read my 197.

I'm wondering about today's logo.

/wtf?

211 bosforus  Thu, May 15, 2008 11:41:20pm

re: #207 bosforus

At first I thought it was a ranking from Alexa but Kos' not so. I think maybe the y-axis is a percentage, maybe of comments, but I have no idea how to interpret all the numbers above the graph.

PIMF

212 MrArchieBunker  Thu, May 15, 2008 11:41:36pm

re: #204 itellu3times

Thanks, THAT makes sense.

213 itellu3times  Thu, May 15, 2008 11:42:00pm

bush plutonium opium Afghanistan

214 karmic_inquisitor  Thu, May 15, 2008 11:42:36pm

re: #210 Killian Bundy

I'm wondering about today's logo.

/wtf?

lasers. Invention thereof.

215 hermeneutics  Thu, May 15, 2008 11:42:56pm

Why do you think she's funded by anything other than the department? This doens't look expensive. My hunch is that she's using university computers and time for her diss.

216 victor_yugo  Thu, May 15, 2008 11:43:20pm

re: #203 zombie

I really like taking pictures of moonbatsbatsbatsbatsss...

...

zombie kernel panic.

Reboot zombie mode.

/

End.

"Assuming that every computer program has at least one bug and one extraneous operation, it necessarily follows that all programs can be reduced to a single instruction that doesn't work."

217 Killian Bundy  Thu, May 15, 2008 11:43:37pm

re: #214 karmic_inquisitor

lasers. Invention thereof.

Thanks.

/duh, never thought to just click the logo

218 The Other Les  Thu, May 15, 2008 11:44:08pm

re: #213 itellu3times

bush plutonium opium Afghanistan

spam spam lovely spam wonderful spam

219 zombie  Thu, May 15, 2008 11:44:27pm

Here's another tiny picture of her. Self portrait.

Was not easy to find.

220 bosforus  Thu, May 15, 2008 11:44:30pm

re: #196 slokat

I was trying to figure out what that one graphed...

I get it now. In[26] is the x-axis. Out[26] is the y-axis. After the arrays are sorted in ascending order they're graphed. But what are the numbers?

221 haakondahl  Thu, May 15, 2008 11:45:04pm

re: #182 karmic_inquisitor

For all we know this could actually be an experiement out of the sociology department where a fake CS experiment is set up and left "wide open" to see which blog would find it first.

/I know - but it is late.

I thought of that before I tried to ftp. Could be a honeypot.

A classic honeypot is creating a file called something like passwords.txt or all_salaries.xls, and leaving it where people can get at it. Then you watch to see who opens or copies the file. Since you created the file, you know that nobody besides you has any legitimate business bothering that file.

222 The Other Les  Thu, May 15, 2008 11:45:19pm

re: #219 zombie

Here's another tiny picture of her. Self portrait.

Was not easy to find.

Pretentious.

223 itellu3times  Thu, May 15, 2008 11:45:21pm

anyway don't we already have carnivore reading our mail? surely it can read blogs, too.

224 bosforus  Thu, May 15, 2008 11:45:29pm

re: #220 bosforus

Actually no, but getting closer.

225 haakondahl  Thu, May 15, 2008 11:45:30pm

re: #208 song_and_dance_man

Where is the boob thread graph?

Turned out goose eggs.

226 song_and_dance_man[deleted]  Thu, May 15, 2008 11:45:50pm
227 Killian Bundy  Thu, May 15, 2008 11:45:52pm

re: #215 hermeneutics

Why do you think she's funded by anything other than the department? This doens't look expensive. My hunch is that she's using university computers and time for her diss.

If someone is funding it they're wasting their money.

/seeing as how all the research is available for free

228 victor_yugo  Thu, May 15, 2008 11:45:52pm

re: #204 itellu3times

well, I glanced at the paper, and still have to guess, but it's some kind of attempt to be able to let some robot read a blog and determine what people are talking about by using keywords rather than really trying to understand what's being said.

CIA/NSA love this kind of stuff.

Actually, that might not be far from the truth. The program doesn't have to understand the conversation. It only needs to observe the conversation until certain statistical values are met, then notify a spook that the thread might be of interest.

229 Render  Thu, May 15, 2008 11:46:42pm

re: #204 itellu3times

Project Echelon

[Link: www.fas.org...]

Project Carnivore

[Link: www.msnbc.msn.com...]

===

Don't think that's where this is going though...

PARANOIA
WILL
DESTROYA,
R

230 victor_yugo  Thu, May 15, 2008 11:47:19pm

re: #219 zombie

Here's another tiny picture of her. Self portrait.

Was not easy to find.

I like that one better.

231 redc1c4  Thu, May 15, 2008 11:47:32pm

re: #208 song_and_dance_man

Where is the boob thread graph?

i was wondering where the FNDT's fit into the scheme of things...

/as opposed to the ONDT's $-)

232 The Other Les  Thu, May 15, 2008 11:47:33pm

re: #226 song_and_dance_man

Islam Rosie hate misogyny link Soros goat Nazi Whore taqiyya Evolution

xenu tom cruise thetan enturbulate rondroid suppressive person

233 zombie  Thu, May 15, 2008 11:47:35pm

Ooooh, I found her entire curriculum vitae.

She's actually a returnee to college from industry. Worked 4 years as a System Engineer
for Ricoh.

234 victor_yugo  Thu, May 15, 2008 11:47:53pm

re: #229 Render

PARANOIA
WILL
DESTROYA,
R

Just because you're paranoid doesn't mean the pizza isn't bugged.

235 The Other Les  Thu, May 15, 2008 11:47:57pm

re: #229 Render

Hillary or BO?

236 zombie  Thu, May 15, 2008 11:48:01pm

re: #226 song_and_dance_man

Islam Rosie hate misogyny link Soros goat Nazi Whore taqiyya Evolution

LOL!

237 Joe Triscari  Thu, May 15, 2008 11:48:01pm

It looks like the python code compares two documents by a cosine measure based on the number of words in each document. It counts the words in each document and does a dot product between the counts normalized by root sum-of-squares. Two documents are most similar when the "cosine" is 1 and different when the "cosine" is 0.

They're probably getting ready to do a study where they compare the similarity between threads and this is some initial set of data to debug the code.

238 The Other Les  Thu, May 15, 2008 11:48:21pm

re: #234 victor_yugo

Just because you're paranoid doesn't mean the pizza isn't bugged.

Speak into potato comrade.

239 Opilio  Thu, May 15, 2008 11:48:28pm

re: #209 Sharmuta

LMAO! That is smurftastic!

I thought you might appreciate that portion of the list :-)

240 zombie  Thu, May 15, 2008 11:48:30pm

LOL STFU STFD :) :-)

241 Fenway_Nation  Thu, May 15, 2008 11:48:37pm

re: #214 karmic_inquisitor

lasers. Invention thereof.

I kinda liked the Mother's Day one with the duck and ducklings

/did I just say that?

242 gymnast  Thu, May 15, 2008 11:48:38pm

re: #174 Dan G.

I can fact check, but I'm not going to chase down a phrase that uses the word "context" without any context... especially when it is slung at me as an ad hominem.

Now you are hoisting yourself on your own pitard. If you are not contextually literate, you will quickly become eligible for the dry pool diving team. Maybe even be eligible to try out for the Varsity Ornithology team if you have the right stuff.

With the exception of the first sentence in the above paragraph, you will not understand the next two statements without specific knowledge of contextual literacy. Google news is highly dependent on the contextual illiteracy of it's casual readers for it's "perceived validity".

243 victor_yugo  Thu, May 15, 2008 11:49:24pm

re: #233 zombie

Dang, zombie, I'm almost scared of what you could turn up on me!

244 slokat  Thu, May 15, 2008 11:49:43pm

Good night Lizards!

&

If you track us back... Thanks Tae, for giving us a fun exercise!
Maybe you could contact Charles & give us an inside look at what this all means?

245 haakondahl  Thu, May 15, 2008 11:50:20pm

re: #238 The Other Les

Speak into potato comrade.

HAH!

246 hermeneutics  Thu, May 15, 2008 11:50:23pm

re: #233 zombie

Her first degree is in International Relations/Affairs. Later, she veered off into computers where she's getting her terminal degree. It makes perfect sense that she'd combine her interest in politics with her knowledge of computers.

247 zombie  Thu, May 15, 2008 11:50:46pm

re: #243 victor_yugo

Dang, zombie, I'm almost scared of what you could turn up on me!

While my comment function is rebooting, I'm falling back on my Google-fu function.

248 bosforus  Thu, May 15, 2008 11:50:58pm

OK, I've successfully recreated lgf1.pdf but have no idea what it means because I don't know what the numbers mean. If anyone has any clue it'd save me a lot of time at work tomorrow. :)

249 Sharmuta  Thu, May 15, 2008 11:51:28pm

re: #239 Opilio

Well- it made me look around more too-

sharmie 18 8

That is almost all Render's doing.

250 haakondahl  Thu, May 15, 2008 11:51:41pm

Well, I'm going to mail her and tell her the jig is up. Fix her permissions.

251 song_and_dance_man[deleted]  Thu, May 15, 2008 11:52:40pm
252 Opilio  Thu, May 15, 2008 11:52:44pm

re: #242 gymnast

Now you are hoisting yourself on your own pitard.

petard

/spelling police, out.

253 zombie  Thu, May 15, 2008 11:52:56pm

Judging that she started college in 1994, that means she was born in 1976, which would make her 32 years old now -- which jibes with her already having a real job for four years and then returning to get a PhD.

254 redc1c4  Thu, May 15, 2008 11:53:19pm

re: #232 The Other Les

xenu tom cruise thetan enturbulate rondroid suppressive person

Tom Cruise used to lease our house: we still get Scientology mail here.

/what a great scam... %-)

255 Render  Thu, May 15, 2008 11:53:42pm

re: #249 Sharmuta

lol

oops...

sry,
R

256 hermeneutics  Thu, May 15, 2008 11:53:51pm

re: #251 song_and_dance_man

IF you look carefully under the cup and on the paper you'll see religious messages. I'm serious. Verses from the Bible are hidden on all In and Out packaging.

257 LeePro  Thu, May 15, 2008 11:53:52pm

re: #171 Charles

Come on. Let's not post disparaging comments about people's looks. Chances are very good they're going to read this thread.

Thank you!

258 Killian Bundy  Thu, May 15, 2008 11:53:53pm

Language and Politics
William Cohen, Noah Smith, and Tae Yano

Most approaches to automatic text analysis and processing treat text as a stream of words or sentences. A typical underlying assumption is that the use of language in the data is literal and that the data represent facts. Many genres, however, do not have these features.

We are exploring automatic methods for analyzing text in the political domain, specifically blog posts on topics pertinent to the 2008 United States Presidential Elections. Political text is often indirect, sarcastic, repetitive, hyperbolic, emotional, biased, manipulative, and riddled with unstated assumptions. Our aim is to automatically separate useful, thoughtful information from redundant "spin," using statistical natural language processing techniques and a data-driven methodology that makes use of the insights of political scientists.

The broader impact of this work will consist of a renewed emphasis exploiting domain knowledge together with text data for more powerful natural language understanding technology, as well as software tools that will promote more informed decision-making among American voters.

/scroll down

259 karmic_inquisitor  Thu, May 15, 2008 11:53:55pm

Good night all.

But I have a question for the great one: Are you keeping later hours than you used to (early morning open threads at 3 AM)? And no bike threads in a while?

Midlife lifestyle change, perhaps?

260 bosforus  Thu, May 15, 2008 11:54:05pm
0.665689731523}, {16., 0.700371724223}, {17., 0.708542529981}, {18.,
0.72730225358}, {19., 0.816460766851}, {20., 0.855324151661}]}, {},
AspectRatio->NCache[GoldenRatio^(-1), 0.6180339887498948],
Axes->True,
AxesOrigin->{0, 0.4},

Hard to go wrong with the golden ratio.

261 victor_yugo  Thu, May 15, 2008 11:54:28pm

re: #248 bosforus

OK, I've successfully recreated lgf1.pdf but have no idea what it means because I don't know what the numbers mean. If anyone has any clue it'd save me a lot of time at work tomorrow. :)

The numbers were fed in raw for each axis, then sorted and plotted sequentially. The meaning is known only to her at this time. Therefore, it's probably a test run, not to be considered valid.

262 redc1c4  Thu, May 15, 2008 11:55:13pm

re: #251 song_and_dance_man

BTW when I come to CA soon I'm taking you up on that In-N-Out burger offer.

no problema... email is active.

Tommys too? %-)

263 Sharmuta  Thu, May 15, 2008 11:56:30pm

re: #255 Render

Don't be sorry, my friend. And, actually, "Sharmie" is the name I use at Pandora.

264 bosforus  Thu, May 15, 2008 11:56:42pm

re: #261 victor_yugo

The numbers were fed in raw for each axis, then sorted and plotted sequentially. The meaning is known only to her at this time. Therefore, it's probably a test run, not to be considered valid.

Is that what's in the lgf1.nb file? It looks like she's making a table in that file.

265 victor_yugo  Thu, May 15, 2008 11:56:53pm

re: #254 redc1c4

Be careful saying that. I'm sure they'd be interested to know that.

If I were you, I'd just turn it all over to a LEO somewhere.

266 The Other Les  Thu, May 15, 2008 11:57:42pm

re: #254 redc1c4

Tom Cruise used to lease our house: we still get Scientology mail here.

/what a great scam... %-)

Horridly fascinating subject. Even wrote some Scientologists into some role playing game fan fiction.

267 bosforus  Thu, May 15, 2008 11:57:54pm

re: #261 victor_yugo

But then that leads to the question of why have 4 test runs. Meh, who knows? Only she.

268 zombie  Thu, May 15, 2008 11:58:02pm

re: #258 Killian Bundy

Language and Politics
William Cohen, Noah Smith, and Tae Yano

/scroll down

Good find!

Cogitating on their thesis, I have determined that it is useless, futile and insulting.

269 haakondahl  Thu, May 15, 2008 11:58:26pm

Sent:

We here at LGF have been rollicking through your files. Interesting stuff! Would you care to let us in on the purpose of your experiment involving LGF and other blogs? If not, how about just the methodology, You have sparked quite a conversation.

*All* of your files seem to be wide open to the public. You need some CHMOD, and quick (of course, I can only speak about the files in directories I can see--but nothing seems to be missing, heh!).

Good Luck,

270 Sharmuta  Thu, May 15, 2008 11:58:47pm
antidisestablishmentarianism 1 1

I'm going to start using that word more.

271 gymnast  Thu, May 15, 2008 11:59:09pm

re: #252 Opilio

petard

/spelling police, out.

Oh shit! Just don't throw me in the brier patch OK? Anything but the brier patch. There are 26 letters in the alphabet and e is horribly overused in my opinion///.

272 song_and_dance_man[deleted]  Thu, May 15, 2008 11:59:36pm
273 haakondahl  Thu, May 15, 2008 11:59:54pm

re: #268 zombie

Good find!

Cogitating on their thesis, I have determined that it is useless, futile and insulting.

Well how do you like the Zio-Buntu-Hippie logo in the upper right?

274 zombie  Fri, May 16, 2008 12:00:01am

re: #258 Killian Bundy

Language and Politics
William Cohen, Noah Smith, and Tae Yano

/scroll down

I may steal that vampiric Hillary photo for the Hillary Gallery.

275 least  Fri, May 16, 2008 12:00:04am

(reuter run the same story, LGF respond twice in the like mnnaer.)

gddo nigh, LGF!

276 Winslow  Fri, May 16, 2008 12:00:30am

re: #271 gymnast

briar

/spelling police, out

277 itellu3times  Fri, May 16, 2008 12:00:38am
Political text is often indirect, sarcastic, repetitive, hyperbolic, emotional, biased, manipulative, and riddled with unstated assumptions.

Byte me.

278 bosforus  Fri, May 16, 2008 12:00:51am
if this script is running as a stand alone script:
take two file, both cleaned up for srilm lm format,
create the vector of words, and caliculate the cosine similarity
between the two

Cosine similarity is defnied as:
cosim(x,y) = sum(x*y) / (sqrt(sum(x^2)) * sqrt(sum(y^2)))

Dealing with cosines. No wonder none of the numbers are above 1.0.

279 zombie  Fri, May 16, 2008 12:00:57am

re: #273 haakondahl

Well how do you like the Zio-Buntu-Hippie logo in the upper right?

Hidden within it, however, is a Star of David!

The Zionist tentacles are everywhere!

280 karmic_inquisitor  Fri, May 16, 2008 12:01:08am

re: #248 bosforus

OK, I've successfully recreated lgf1.pdf
but have no idea what it means because I don't know what the numbers
mean. If anyone has any clue it'd save me a lot of time at work
tomorrow. :)

It is a guess based on her "cosine correlation" code, but it looks to me to tell you the degree to which a set of posts from a given blogger (in this case Charles) are simillar. Since she is scoring the comments, you are looking at the degree to which we behave like a herd. Perfect correlation is R=1 - we are certainly closer to that than R=0.

The "straighter" the line, the "straighter" the jacket.

281 haakondahl  Fri, May 16, 2008 12:01:09am
Yahoo! Inc is assisting research at the LTI by providing access to a 4,000-processor supercomputer running open-source distributed computing software such as Hadoop and the Pig parallel programming language.

"Hadoop and the Pig"?

282 redc1c4  Fri, May 16, 2008 12:01:15am

re: #256 hermeneutics

IF you look carefully under the cup and on the paper you'll see religious messages. I'm serious. Verses from the Bible are hidden on all In and Out packaging.

they're not hidden, just understated. my ex was an acquaintance of the lady who founded the company with her husband. she was a very religious person who believed that her day to day actions should be her testament. that's why In n Out pays better than minimum wage and gives benefits to all of it's employees, and promotes from within. (or at least they used to: she died, and now the younger generation is in charge & talking about expanding.)

283 Salem  Fri, May 16, 2008 12:02:30am

There's this thing on the Travel Channel about raising Kobe beef and they're showing this procession of steers getting their asses massaged by guys wearing those funny hats. That's one of those "only in Japan" kinda things...

Moo! Ah, I see it's time for my daily ass massage! Eat me, world!

284 Sharmuta  Fri, May 16, 2008 12:02:35am
fuck 496 224

I wonder what the kos dictionary total on this word is.

285 The Other Les  Fri, May 16, 2008 12:02:53am

re: #273 haakondahl

Well how do you like the Zio-Buntu-Hippie logo in the upper right?

Too Jewish. The Mohammedans are going to have a fit about that.

286 bosforus  Fri, May 16, 2008 12:03:13am

re: #280 karmic_inquisitor

It is a guess based on her "cosine correlation" code, but it looks to me to tell you the degree to which a set of posts from a given blogger (in this case Charles) are simillar. Since she is scoring the comments, you are looking at the degree to which we behave like a herd. Perfect correlation is R=1 - we are certainly closer to that than R=0.

The "straighter" the line, the "straighter" the jacket.

Let's just copy and paste everything Charles says for a month. See how she likes that.

287 slokat  Fri, May 16, 2008 12:04:23am

Saw this as I was leaving - using more than hamsters...

Yahoo! Inc is assisting research at the LTI by providing access to a 4,000-processor supercomputer running open-source distributed computing software such as Hadoop and the Pig parallel programming language.

288 Alberta Oil Peon  Fri, May 16, 2008 12:04:33am

re: #199 The Other Les

She uses the phrase "sustainable fashion."

That is not a good sign.

So polar bear hide slippers are out, eh? Pity, that.

289 Killian Bundy  Fri, May 16, 2008 12:04:50am

re: #268 zombie

Cogitating on their thesis, I have determined that it is useless, futile and insulting.

/just like LGF Watch

290 zombie  Fri, May 16, 2008 12:05:25am

Here's her 1997 class photo.

Ah, the good ol' days.

291 song_and_dance_man[deleted]  Fri, May 16, 2008 12:05:46am
292 redc1c4  Fri, May 16, 2008 12:06:51am

re: #265 victor_yugo

Be careful saying that. I'm sure they'd be interested to know that.

If I were you, I'd just turn it all over to a LEO somewhere.

we follow Harmony Church Rules here... neither Thetans nor Clears scare me: neither one wears a DS hat. getting yo' ass "fact checked" is the *least* of your worries if you show up uninvited and unwelcome. %-)

mail is just advertising for the various classes... thousands of $ for even the simplest things. (and every time i mention it in public, the volume drops off more. %-)

293 karmic_inquisitor  Fri, May 16, 2008 12:07:25am

Alright - here is the core of it ...

[Link: www.cs.cmu.edu...]

It is pretty simple stuff - she has two functions in the file - one to create the dictionary and the other to score her cosine correlation.

From that the stats flow.

294 cargocultist  Fri, May 16, 2008 12:07:28am

I suggest that Charles email her and ask to talk to her on the phone. Then interview her and post the interview on LGF. I would find it very interesting. I bet this lady has tons of interesting ideas and observations about all kinds of things including blogs like LGF.

295 redc1c4  Fri, May 16, 2008 12:07:32am

re: #284 Sharmuta

I wonder what the kos dictionary total on this word is.

dunno, but ours are likely all Mandy's... %-)

/white smoke!

296 JustMyView  Fri, May 16, 2008 12:07:49am

re: #269 haakondahl

Sent:

Nicely done! Will be curious about the response.

297 zombie  Fri, May 16, 2008 12:07:50am

Well, I know where Tae was on June 17, 2006!

I hope Wendy and Paul are doing well.

298 itellu3times  Fri, May 16, 2008 12:09:38am

What is the cosine for nakba?

299 victor_yugo  Fri, May 16, 2008 12:09:53am

re: #297 zombie

Dang, zomb, you just broke some hearts.

300 victor_yugo  Fri, May 16, 2008 12:10:23am

re: #298 itellu3times

What is the cosine for nakba?

None. Nakba's credit is so bad, nobody will cosine for them.

301 gop_patriot  Fri, May 16, 2008 12:11:08am

The comment with my nic on it wasn't even my comment, it was a response to me. 'Cause I didn't say this:

re _meta_number_ref_ gop_patriot my closest friend looks like ilsa the sword wielding teutonic amazon and is she is the one with the shoshone blood direct line to charboneau and sacajawea the rest is scottish german and french

But I remember the discussion, though. Interesting.

302 victor_yugo  Fri, May 16, 2008 12:11:27am

re: #297 zombie

Well, I know where Tae was on June 17, 2006!

I hope Wendy and Paul are doing well.

Woops, for both you and me.

Tae was there, but Paul married Wendy.

303 zombie  Fri, May 16, 2008 12:11:37am

OK, I'll stop.

Tae, let that be a lesson: Blogs are not inert things that can be studied dispassionately! Sometimes they can bite back -- jump right up at you through the screen.

And the meaning of sentences cannot be dissected by computer analysis.

304 victor_yugo  Fri, May 16, 2008 12:11:40am

That's the part I missed.

305 Opilio  Fri, May 16, 2008 12:11:42am

re: #284 Sharmuta

I wonder what the kos dictionary total on this word is.

dKos Dictionary

f*ck 837 405
f*cked 202 165
f*cking 1166 452

And on a not necessarily related note:

bush 10405 875

306 Killian Bundy  Fri, May 16, 2008 12:11:46am

re: #297 zombie

Well, I know where Tae was on June 17, 2006!

I hope Wendy and Paul are doing well.

Yeah, I was going to post that one.

/but she wasn't the one getting married, which would have given a new name to search

307 haakondahl  Fri, May 16, 2008 12:11:50am

This is disturbing:


rant_debate (mean-comment-length) classifier.
10-hold cross validation


It seems that she is either assuming a correlation or trying to measure a correlation between comment length and quality.

Sounds like bunkum to me.

/I wonder just how far that sort of "value judgment" will go at CMU?

308 LeePro  Fri, May 16, 2008 12:11:59am

re: #252 Opilio

petard

/spelling police, out.

Good job, deputy!

; )

309 gymnast  Fri, May 16, 2008 12:12:43am

re: #276 Winslow

briar

/spelling police, out

You must have a $20 dollar spell checker and I have a .20 cent spell checker. What is it about briars or briers that you don't understand? My "briars" has a red line under it, and my "briers" doesn't . Take it up with Uncle Remus or Walt Disney if it really bothers you. ///

310 gop_patriot  Fri, May 16, 2008 12:13:30am

re: #308 LeePro

Good evening, Lee, how are ya? :) Been down to Barbecue Fest?

/my best friend called me from the river. I am seriously jealous. lol

311 Sharmuta  Fri, May 16, 2008 12:13:51am

re: #305 Opilio

Well- there we have it!

kos- twice as profane with half the intellect.

312 Psaturn  Fri, May 16, 2008 12:14:22am

re: #256 hermeneutics

IF you look carefully under the cup and on the paper you'll see religious messages. I'm serious. Verses from the Bible are hidden on all In and Out packaging.

Well...actually there are Biblical references such as John 3:16

I am not sure about other packaging though...

In and Out had been owned by a family who is very Christian and strong follower of Chuck Smith of Calvary Chapel in Costa Mesa...

In and Out has excellent burgers and fries...I had that last night...

I think one of the busiest In and Out is probably in 1000 Palms California...

313 justacanuck  Fri, May 16, 2008 12:14:24am

Some interesting file names are scrolling by ...

kos_vs_carpetbagger

WTF?

lol

314 PloniAlmoni  Fri, May 16, 2008 12:14:33am

So some students are doing research on blogs. No big deal. It was interesting to see what it was about, but that's about it. I don't think they feel the need to protect their files. Why should they? Who would want to mess with that? It's just a school project, I don't see point of "exposing" the students and publicizing their information. At least that's my take.

315 Winslow  Fri, May 16, 2008 12:14:44am

re: #309 gymnast

Google briar patch, and you'll see. LGF's spell checker is not infallible.

316 dgax65  Fri, May 16, 2008 12:15:41am

re: #215 hermeneutics

Why do you think she's funded by anything other than the department? This doens't look expensive. My hunch is that she's using university computers and time for her diss.

I noticed that the research group to which she belongs is partially funded by a DARPA grant.

DARPA Computer Science Study Panel program (grant number HR00110110013)

317 bosforus  Fri, May 16, 2008 12:16:14am

re: #298 itellu3times

What is the cosine for nakba?

well, adding the letters to their numeric counterpart nakba = -0.74806
Now, the cosine similarity as defined by Tae, no idea.

318 pat  Fri, May 16, 2008 12:16:23am

re: #316 dgax65

Oh. Now that is sad.

319 gymnast  Fri, May 16, 2008 12:17:19am

re: #290 zombie

Here's her 1997 class photo.

Ah, the good ol' days.

Zombie, didn't Billy Clinton's favorite intern go to Lewis and Clark College? The Famous La Monica Lewinsky

320 pat  Fri, May 16, 2008 12:18:49am

re: #319 gymnast

I had a very hot girl friend that went there.

321 The Other Les  Fri, May 16, 2008 12:18:49am

Good night.

322 HelloDare  Fri, May 16, 2008 12:18:56am

Maybe Tae is just nuts. Have you seen the film, "A Beautiful Mind"?

323 gop_patriot  Fri, May 16, 2008 12:19:46am

re: #319 gymnast

Zombie, didn't Billy Clinton's favorite intern go to Lewis and Clark College? The Famous La Monica Lewinsky

re: #320 pat

I had a very hot girl friend that went there.

Hmmm

roflolol jk ;)

324 RTLM  Fri, May 16, 2008 12:19:52am

/Hello, I'm agent Rtlm with the Association for the Advancement of Artificial Intelligence. I will now view your I.D.

(flashes AAAI badge)

325 LeePro  Fri, May 16, 2008 12:19:54am

re: #270 Sharmuta

  antidisestablishmentarianism
I'm going to start using that word more.

OMG!
I haven't heard that word since my daddy taught us kids about it a gazillion years ago!

326 pat  Fri, May 16, 2008 12:19:55am

The vagaries of academic endeavor.

327 zombie  Fri, May 16, 2008 12:20:13am

re: #319 gymnast

Zombie, didn't Billy Clinton's favorite intern go to Lewis and Clark College? The Famous La Monica Lewinsky

Yup.

She was two years ahead of Tae.

328 bosforus  Fri, May 16, 2008 12:20:19am

re: #322 HelloDare

Maybe Tae is just nuts. Have you seen the film, "A Beautiful Mind"?

Nah, she just likes numbers. I've done similar "number games" like this and it was just for fun!

329 hermeneutics  Fri, May 16, 2008 12:20:46am

re: #314 PloniAlmoni

I'm with you, PA. This poor girl is just trying to please her advisor and finish her darn dissertation ... and dress well. Sounds sadly typical to me.

330 Ghost707  Fri, May 16, 2008 12:21:00am

If they have any servers running AIX/Unix I might be able to do something.
They won't like the results after I run diags though.

/smirk.

331 gymnast  Fri, May 16, 2008 12:22:20am

re: #315 Winslow

Google briar patch, and you'll see. LGF's spell checker is not infallible.

I am sure you are right. I spelled it "briar" the first time and acceded to the pressure of conforming to the red line. There being 26 letters in the alphabet, and as little as they cost, I tend to use them both randomly and freely.

332 zombie  Fri, May 16, 2008 12:23:13am

re: #329 hermeneutics

I'm with you, PA. This poor girl is just trying to please her advisor and finish her darn dissertation ... and dress well. Sounds sadly typical to me.

Well, she's a full-grown woman now.

More like, to my mind: She knows that with a PhD she could draw a bigger salary once she gets hired back out at a big computer company. If she had stayed at Ricoh with just a BS or MS, she'd be on a lower salary track.

333 LeePro  Fri, May 16, 2008 12:23:13am

re: #274 zombie

I may steal that vampiric Hillary photo for the Hillary Gallery.

Photoshop those eye-teeth a little.

hee hee

334 victor_yugo  Fri, May 16, 2008 12:23:32am

re: #315 Winslow

Google briar patch, and you'll see. LGF's spell checker is not infallible.

The "red line" referrs to the spel-cheker in teh browser (probaly Firefox or Mozilla).

335 Rookie  Fri, May 16, 2008 12:23:42am

OK... so they run useless checks on blogs postings for some obscure objective to justify some funds spending.

At least, after 4 years, did they realized that the moonbat blogs like KOS are populated by some very disturbed people? (Oops, I made a reference to the other side...)

Here is one word for your machine to crunch, oh wise "scientists":

Don’twastetimewithuselessresearchwhenyouhave betterthingstodoliketeachingproperenglishtoyouruni versitystudentsandkeeppoliticalpropagandaandpoliti calcorrectnessoutoftheclassroom

Here - 175 letter word for you, that's under 256 characters so maybe it will make history in the database.

336 hermeneutics  Fri, May 16, 2008 12:24:47am

re: #332 zombie

Oh, you're right. I'm so used to history PhDs that make 50k when then finally finish ... nine years later and in debt. The thought that someone may actually MAKE money afterward is mindboggling. Obviously, you've done it -- I haven't.

337 haakondahl  Fri, May 16, 2008 12:25:00am

re: #314 PloniAlmoni

So some students are doing research on blogs. No big deal. It was interesting to see what it was about, but that's about it. I don't think they feel the need to protect their files. Why should they? Who would want to mess with that? It's just a school project, I don't see point of "exposing" the students and publicizing their information. At least that's my take.

I agree. Now that we know (more or less) what this is about, why continue to build up some weird fan club? Or worse? It's creepy.

338 Racer X  Fri, May 16, 2008 12:25:34am

Pre-cursor to the Darma Initiative?

339 Winslow  Fri, May 16, 2008 12:26:29am

re: #334 victor_yugo

That makes at least two fallible spell-checkers.

340 LeePro  Fri, May 16, 2008 12:26:40am

re: #286 bosforus

Let's just copy and paste everything Charles says for a month. See how she likes that.

ROFLOL

341 haakondahl  Fri, May 16, 2008 12:27:04am

re: #330 Ghost707

If they have any servers running AIX/Unix I might be able to do something.
They won't like the results after I run diags though.

/smirk.

Gee, you're going to run a DOS against a University? And you're advertising it here? Or just--you know--claiming that you could? You know--if you wanted to?

342 Killian Bundy  Fri, May 16, 2008 12:28:12am

Noah's ARK

Noah's ARK[1] is Noah Smith's informal research group at the Language Technologies Institute, School of Computer Science, Carnegie Mellon University. (The research is formal; the group is informal.) As you may have guessed, our research focuses on problems of ambiguity and uncertainty in natural language processing, including morphology, syntax, semantics, and translation.

/her graduate adviser, who has two cats named Dr. Sacada and Rudolph Valentino

343 bosforus  Fri, May 16, 2008 12:28:23am

Well, in other news, I got a birdie on a par 3 today so I was pretty happy about that. Still ended up about 17 over on a 9-hole par 3 course but oh well.
G'night, Lizards!

344 Sharmuta  Fri, May 16, 2008 12:28:58am
haz 30 21

lolcat 44 6

We kan haz fundz frum antidisestablishmentarians?

345 haakondahl  Fri, May 16, 2008 12:29:53am

re: #342 Killian Bundy

Ah. "Natural Language Processing".

Not Neuro-Linguistic Programming.

346 zombie  Fri, May 16, 2008 12:30:16am

re: #336 hermeneutics

Oh, you're right. I'm so used to history PhDs that make 50k when then finally finish ... nine years later and in debt. The thought that someone may actually MAKE money afterward is mindboggling. Obviously, you've done it -- I haven't.

Just because I know about it, doesn't mean I've done it.

I should be moving into a bigger cardboard box next week. This one's getting a little damp.

347 hermeneutics  Fri, May 16, 2008 12:31:38am

re: #346 zombie

Ha -- I've got you beat, Zombie: I'm in a pop-up!

348 haakondahl  Fri, May 16, 2008 12:32:04am

Ah. I am an anticorruptworlddeliberativebodynondisestablishmen tarianist. Down with the UN!

349 redc1c4  Fri, May 16, 2008 12:33:09am

re: #324 RTLM

/Hello, I'm agent Rtlm with the Association for the Advancement of Artificial Intelligence. I will now view your I.D.

(flashes AAAI badge)

/me flashes Rtlm...

> cue sound of mechanical destruction/failure

350 LeePro  Fri, May 16, 2008 12:33:58am

re: #310 gop_patriot

Good evening, Lee, how are ya? :) Been down to Barbecue Fest?

/my best friend called me from the river. I am seriously jealous. lol

Nope.
Damn!
Can't eat!

Commented back to you at my #640 coupla' threads back. Spent about 15 minutes talking to myself there earlier, before I discovered nobody was home!

351 zombie  Fri, May 16, 2008 12:34:09am

re: #337 haakondahl

I agree. Now that we know (more or less) what this is about, why continue to build up some weird fan club? Or worse? It's creepy.

I think the feeling is: We didn't like being the subject of some kind of weird language-analysis experiment, without our consent. So we're curious as to who's experimenting on us. A natural curiosity, I should think.

Turns out we couldn't really find anything too sinister about the whole thing, but i don't think it was out-of-line to nose around her academic life to see if there was some kind of underhanded agenda.

352 LeePro  Fri, May 16, 2008 12:35:58am

re: #323 gop_patriot

Hmmm

roflolol jk ;)

ROFLOL

I had the exact same thought. You beat me to it!

353 Racer X  Fri, May 16, 2008 12:36:34am

re: #351 zombie

Turns out we couldn't really find anything too sinister about the whole thing, but i don't think it was out-of-line to nose around her academic life to see if there was some kind of underhanded agenda.

Plus the results are now skewed - because an entire thread discussion was about her.

354 redc1c4  Fri, May 16, 2008 12:36:44am

re: #346 zombie

Just because I know about it, doesn't mean I've done it.

I should be moving into a bigger cardboard box next week. This one's getting a little damp.

ah yes... summer in Frisco.

/winter everywhere else. %-)

355 victor_yugo  Fri, May 16, 2008 12:37:44am

re: #351 zombie

I think the feeling is: We didn't like being the subject of some kind of weird language-analysis experiment, without our consent.

I have no problem with it. I'm posting on a publicly-readable website, so anything I say may be subject to scrutiny by any party on the Internet. If I don't like it that way, I should STFU and STFD.

356 Sharmuta  Fri, May 16, 2008 12:38:14am

re: #348 haakondahl

LMAO!

I'm an anticaliphateestablismentarianist.

357 Render  Fri, May 16, 2008 12:39:06am

I saw at least one reference to work with or for children.

We also saw the reference to DARPA, which implies a military and/or intelligence function.

It's been an interesting exercise, to say the least. Now we know more about the researcher than she does about us, the subjects.

LET THAT
BE A LESSON,
R

358 gymnast  Fri, May 16, 2008 12:39:26am

If nothing else, the young ladies research should provide you with some idea of what is going on with satellite telecommunications keyword programs used to intercept international calls.

359 redc1c4  Fri, May 16, 2008 12:39:34am

re: #355 victor_yugo

I have no problem with it. I'm posting on a publicly-readable website, so anything I say may be subject to scrutiny by any party on the Internet. If I don't like it that way, I should STFU and STFD.

she is allowed to be curious...

we're allowed to be curious about her curiosity...

/yellow

360 LeePro  Fri, May 16, 2008 12:40:01am

re: #344 Sharmuta

 haz 30 21

 lolcat 44 6

We kan haz fundz frum antidisestablishmentarians?

LOLOL

361 victor_yugo  Fri, May 16, 2008 12:40:39am

re: #357 Render

We also saw the reference to DARPA, which implies a military and/or intelligence function.

And the same techniques/analyses will probably work on Arabic-language websites, too.

Good catch, Render.

362 gop_patriot  Fri, May 16, 2008 12:43:22am

re: #350 LeePro

Going to look. :) Sorry about the no eating thing, darnit, I forgot. Still, we haven't hit The Commissary yet, so there's that to look forward to!

re: #352 LeePro

;) I win!

363 Clemente  Fri, May 16, 2008 12:45:37am

I'm naming my next sock "_meta_end_dot_"!

364 PloniAlmoni  Fri, May 16, 2008 12:46:05am

re: #351 zombie

I think the feeling is: We didn't like being the subject of some kind of weird language-analysis experiment, without our consent. So we're curious as to who's experimenting on us. A natural curiosity, I should think.

Turns out we couldn't really find anything too sinister about the whole thing, but i don't think it was out-of-line to nose around her academic life to see if there was some kind of underhanded agenda.

Well everything on LGF, like most websites, is public information. I wouldn't get consent if I was them. I wouldn't feel like I need to. But yeah, there's nothing wrong about lizards being curious about this project. I know most people here are good people, unlike the rasicts and anti-semites at the DKOS, but you never know who's here on the net. I wouldn't want her to receive threatening emails from some nutcase who's browsing here, that's all. I know we're better than talking trash or saying anything stupid, so I don't think it's a big deal at all. I'm just chipping in with my opinion that there's no real reason to make it personal or to bring the student's identity into this.

365 ghost707  Fri, May 16, 2008 12:47:03am

re: #341 haakondahl

Gee, you're going to run a DOS against a University? And you're advertising it here? Or just--you know--claiming that you could? You know--if you wanted to?

Uh no.
The DOS was your idea. Not usually what I think of while in diags.
Sense of humor required.

366 reggie  Fri, May 16, 2008 12:50:10am

Reads like a transcript. Maybe someone has suggested this, but if the terms aren't retrieved by searching LGF, then perhaps they feed a robot that auto-posts here. Some right-thinking things in there, a few strokes for Charles, too. Then the programmer throws in a subversive comment here and there.

367 LeePro  Fri, May 16, 2008 12:54:02am

Egads! ! !

2:50am is way-y-y-y-y-y too late for me!

G'nite to { { {all my favorite Lizards!} } }

'Nite, red.

368 JimmyTheClaw  Fri, May 16, 2008 12:56:47am

re: #364 PloniAlmoni

Well everything on LGF, like most websites, is public information. I wouldn't get consent if I was them. I wouldn't feel like I need to. But yeah, there's nothing wrong about lizards being curious about this project. I know most people here are good people, unlike the rasicts and anti-semites at the DKOS, but you never know who's here on the net. I wouldn't want her to receive threatening emails from some nutcase who's browsing here, that's all. I know we're better than talking trash or saying anything stupid, so I don't think it's a big deal at all. I'm just chipping in with my opinion that there's no real reason to make it personal or to bring the student's identity into this.

bingo this might be part of the experiment to see difference in reaction. basicly we all have class

369 gop_patriot  Fri, May 16, 2008 12:57:09am

re: #350 LeePro

And I agree with your assessment (on the other thread) of the Graceland area. :/ It's scary as hell down there. (Safe when the cops are there protecting the tourists, otherwise, yikes)

I went down there last back in the 90s, for the big memorial thingie in August, with some friends. Some lady said a prayer... and about halfway through, she stopped talking to God and started talking to Elvis. And not one of the fanatics batted an eye. After we recovered from the shock, my friends and I almost wet our pants laughing. All the "serious" fans gave us the hairy eyeball over that. LOL

/full disclaimer; my dad went to high school with Elvis, and knew him a little. Said he was a very nice guy. So no hatin' from the fans. :)

370 gop_patriot  Fri, May 16, 2008 12:58:16am

re: #367 LeePro

Dang, I missed her. Goodnight! :)

371 Slumbering Behemoth  Fri, May 16, 2008 12:58:28am

re: #4 RTLM

Gee, you guys are all famous!

And yet there is nary a mention of the Behemoth what Slumbers. Gonna hafta work on that.
/

G'nite Lizards.

372 redc1c4  Fri, May 16, 2008 1:00:13am

re: #366 reggie

Reads like a transcript. Maybe someone has suggested this, but if the terms aren't retrieved by searching LGF, then perhaps they feed a robot that auto-posts here. Some right-thinking things in there, a few strokes for Charles, too. Then the programmer throws in a subversive comment here and there.

they could call it "AutoMoby"!

/wouldn't last long %-)

373 LeePro  Fri, May 16, 2008 1:00:43am

re: #370 gop_patriot

Dang, I missed her. Goodnight! :)

Short retro-lurk...

G'nite {gop}!

374 gop_patriot  Fri, May 16, 2008 1:10:54am

Completely OT:
My son is watching Fear and Loathing in Las Vegas. I've never seen the movie all the way through, and a little while ago I watched a bit I'd not seen before. Benicio's character was waving a big knife around, Johnny Depp's character said he couldn't go to sleep with him wandering around wanting to cut him up with a knife... Benicio said: "Who said I wanted to cut you up? I just wanted to cut a little 'Z' in your forehead"...

Holy crap that's a funny movie. Going to have to rent it and see the whole thing. And read the book... :)

375 redc1c4  Fri, May 16, 2008 1:14:40am

re: #374 gop_patriot

Completely OT:
My son is watching Fear and Loathing in Las Vegas. I've never seen the movie all the way through, and a little while ago I watched a bit I'd not seen before. Benicio's character was waving a big knife around, Johnny Depp's character said he couldn't go to sleep with him wandering around wanting to cut him up with a knife... Benicio said: "Who said I wanted to cut you up? I just wanted to cut a little 'Z' in your forehead"...

Holy crap that's a funny movie. Going to have to rent it and see the whole thing. And read the book... :)

for some real fun, try reenacting it... %-)

376 gop_patriot  Fri, May 16, 2008 1:18:28am

re: #375 redc1c4

for some real fun, try reenacting it... %-)

I'm trying, dammit. But I only have beer. Ah well, there's always tomorrow night. I'll just live vicariously through movies and videos. ;)

377 redc1c4  Fri, May 16, 2008 1:22:32am

is nothing sacred?

Shania Twain & her hubby are breaking up... %-)

378 redc1c4  Fri, May 16, 2008 1:23:26am

re: #376 gop_patriot

I'm trying, dammit. But I only have beer. Ah well, there's always tomorrow night. I'll just live vicariously through movies and videos. ;)

then drink more beer... it's always w*rked for me!

/one way or another. %-)

379 Clovis69  Fri, May 16, 2008 1:23:52am

Could this be a way to come up with some sort of blog Turing test? Look for structure from popular blogs and come up with a way to machine create comments that will pass for human?

380 gop_patriot  Fri, May 16, 2008 1:28:36am

re: #378 redc1c4

Working on it. Having a TNDT all by myself, apparently. Unless you've been into the Scotch tonight? ;)

381 redc1c4  Fri, May 16, 2008 1:31:03am

re: #379 Clovis69

Could this be a way to come up with some sort of blog Turing test? Look for structure from popular blogs and come up with a way to machine create comments that will pass for human?

never attribute to malice what can be explained by stupidity...

/side bar: what would you suggest as supportive reading to go with "Heights of Courage"? 19D

382 redc1c4  Fri, May 16, 2008 1:31:40am

re: #380 gop_patriot

Working on it. Having a TNDT all by myself, apparently. Unless you've been into the Scotch tonight? ;)

you have to ask? %-)

CHANGE!

383 gop_patriot  Fri, May 16, 2008 1:31:45am

re: #379 Clovis69

Interesting thought. Could a machine recognize certain words, and make comments that "fit in" without seeming to random... Hmm. And if someone responded to that machine's comment, could it reply and make any sense at all? Hmm again.

384 hopefulone  Fri, May 16, 2008 1:33:14am

1) Application of this kind of research can create its own reality. "The figures don't lie but the liars can figure" is one cynical way of looking at it. Another perspective on the bootstrapped reality is that people looking for analysis may buy this stuff without understanding it--especially if it's done slickly/pretty and the algorithms are proprietary.

2) What fun to hack their input stream! ...for ideas like Dan G.'s #132. Digging into their scripts I bet there is plenty of opportunity to cause mischief ^Z, or massage the message by looking at how they parse words, ...or even have it mail stuff to someone ('sendmail -s "Mail from CMU" president@whitehouse.gov

385 Killian Bundy  Fri, May 16, 2008 1:34:44am

DeGeneres, de Rossi to marry

US talkshow host and comedian Ellen DeGeneres is to wed her long-time Australian partner Portia de Rossi following a court ruling which dumps the state's ban on gay marriages.

/well, that didn't take long

386 gop_patriot  Fri, May 16, 2008 1:35:35am

re: #382 redc1c4

you have to ask? %-)

CHANGE!

LOL

/drink

387 hopefulone  Fri, May 16, 2008 1:36:11am

'Sorry if the end of my last post caused you grief, Mr. Johnson. I notice my

388 gop_patriot  Fri, May 16, 2008 1:36:48am

re: #384 hopefulone

Hello, first post, eh? :) Are you in the States?

/ps- don't do it!

389 hopefulone  Fri, May 16, 2008 1:37:40am

..."embedded commands" got eaten.

I am causing problems here so it's likely stuff would work there.

390 Hengineer  Fri, May 16, 2008 1:39:43am

re: #384 hopefulone

1) Application of this kind of research can create its own reality. "The figures don't lie but the liars can figure" is one cynical way of looking at it. Another perspective on the bootstrapped reality is that people looking for analysis may buy this stuff without understanding it--especially if it's done slickly/pretty and the algorithms are proprietary.

2) What fun to hack their input stream! ...for ideas like Dan G.'s #132. Digging into their scripts I bet there is plenty of opportunity to cause mischief ^Z, or massage the message by looking at how they parse words, ...or even have it mail stuff to someone ('sendmail -s "Mail from CMU" president@whitehouse.gov

On point number 2, there is no real need to do any of that, because once the subject knows they are being studied, that by itself skews the results.
You see it everywhere, and that principle has its roots in physics. The mere act of measuring a system introduces uknowns that make it impossible to get an accurate measurement of what the system would be: e.g. the Heisenberg uncertainty principle, you can know the position of a particle, or the velocity of the particle, but never both at the same time. By measuring for one, you disturb the particle making it impossible to measure the other.

391 Hengineer  Fri, May 16, 2008 1:41:08am

re: #382 redc1c4

you have to ask? %-)

CHANGE!

/drink

392 gop_patriot  Fri, May 16, 2008 1:42:50am

Hi, Hengineer, how are you? :)

393 redc1c4  Fri, May 16, 2008 1:43:44am

re: #385 Killian Bundy

DeGeneres, de Rossi to marry

/well, that didn't take long

and, in a few months, de Rossi sues for alimony and goes on her way...
typical Hollywood.

/cynic

394 redc1c4  Fri, May 16, 2008 1:44:45am

re: #391 Hengineer

/drink

too bad the only distillery on that scow is for water, eh? %-)

/don't ask, don't tell.

395 Hengineer  Fri, May 16, 2008 1:45:21am

re: #392 gop_patriot

quite alright, and you?

396 Hengineer  Fri, May 16, 2008 1:45:43am

re: #394 redc1c4

I have no idea what you are referring to
/whistles.

397 redc1c4  Fri, May 16, 2008 1:45:58am

re: #389 hopefulone

..."embedded commands" got eaten.

I am causing problems here so it's likely stuff would work there.

that should be a "hint & a half fo yo ass", as my drills used to say... %-)

398 Hengineer  Fri, May 16, 2008 1:47:23am

We just left Fujairah, U.A.E. Fujairah is on the Arabian sea/Indian ocean side of the Straight of Hormuz. It's about a 2-3 hour drive from Dubai. We weren't allowed to leave the port complex, but we were only in for an overnighter. 3 duty free stores (one was closed because one of the owners passed away, and the funeral was being held), and 2 bars. Made for good, fun times that pretty much the entire ship enjoyed.

Had a blog post about it, too (one of the few times I actually update my blog).

399 gop_patriot  Fri, May 16, 2008 1:47:55am

re: #395 Hengineer

Peachy! And a teensy bit tipsy. :p

What part of the world are you in, if you can say?

400 Hengineer  Fri, May 16, 2008 1:48:43am

re: #399 gop_patriot

Oh somewhere in the 5th fleet.

401 gop_patriot  Fri, May 16, 2008 1:48:46am

Nevermind, you read my mind and answered before I asked. Scary. ;)

402 Hengineer  Fri, May 16, 2008 1:49:41am

re: #401 gop_patriot

Nevermind, you read my mind and answered before I asked. Scary. ;)

well I decided to have a shameless plug about my latest blog entry. One night of enjoyment and drinking after at least 2 weeks at sea.

/worth the hangover

403 PloniAlmoni  Fri, May 16, 2008 1:49:46am

re: #390 Hengineer
Yeah, the double-slit experiment is fascinating to study.

404 redc1c4  Fri, May 16, 2008 1:50:12am

re: #396 Hengineer

I have no idea what you are referring to
/whistles.

that means it's boiling water, and is running too hot...

neck it down a skosh. %-)

/my FIL was on the Showboat during the late unpleasantness with our allies the Japanese , and there are some stories there you could relate to...

405 Hengineer  Fri, May 16, 2008 1:52:23am

re: #404 redc1c4

that means it's boiling water, and is running too hot...

neck it down a skosh. %-)

/my FIL was on the Showboat during the late unpleasantness with our allies the Japanese , and there are some stories there you could relate to...

I'm sure...excepting the torpedo one

406 Alberta Oil Peon  Fri, May 16, 2008 1:53:12am

re: #383 gop_patriot

Interesting thought. Could a machine recognize certain words, and make comments that "fit in" without seeming to random... Hmm. And if someone responded to that machine's comment, could it reply and make any sense at all? Hmm again.

I seem to remember that many years ago, as in before the advent of personal computers, someone wrote a program that "did" psychoanalysis. The program asked questions of a "patient", read the responses (all via a terminal, IIRC), and then followed up by asking new questions derived from key words in the response. The result was eerily similar to a live psychiatrist. I guess one might call that an expert system.

I'm sure that one could write a program to converse on a political blog, but I think that the smarter people would eventually see through it.

407 gop_patriot  Fri, May 16, 2008 1:54:43am

re: #402 Hengineer

Thanks for the reminder, I just went and looked. Good pictures, did you take them all?

408 Hengineer  Fri, May 16, 2008 1:54:48am

re: #383 gop_patriot

Interesting thought. Could a machine recognize certain words, and make comments that "fit in" without seeming to random... Hmm. And if someone responded to that machine's comment, could it reply and make any sense at all? Hmm again.

Well, by the same thought, could one program two computers to talk to each other? Then come back the next day to see if the conversation turns to gibberish?

409 Hengineer  Fri, May 16, 2008 1:55:36am

re: #407 gop_patriot

Thanks for the reminder, I just went and looked. Good pictures, did you take them all?

Yep

/gotta go, time to go back to work!

410 redc1c4  Fri, May 16, 2008 1:56:10am

re: #405 Hengineer

I'm sure...excepting the torpedo one

yeah, that was ugly, but i was thinking more of the ones i have been told that have to do with industrial size boxes of raisins, inventive Sailors, and good old American know how...

that and steaming for days on end out in the big ocean.

/haze grey & underway

411 gop_patriot  Fri, May 16, 2008 1:56:44am

re: #406 Alberta Oil Peon

I'm sure that one could write a program to converse on a political blog, but I think that the smarter people would eventually see through it.

And find a way to block it, and send the machine lots of SPAM and pr0n links to muck up the works. LOL

412 gop_patriot  Fri, May 16, 2008 1:57:16am

re: #409 Hengineer

Yep

/gotta go, time to go back to work!

STAY SAFE, ttyl

(((Hengineer)))

413 littleoldlady  Fri, May 16, 2008 2:00:03am

Good morning, afternoon, evening *everyone*!™

Fruitcup is on the buffet --->
Help yourselves!

414 Fenway_Nation  Fri, May 16, 2008 2:03:41am

re: #413 littleoldlady

You'll be pleased to know I've abandoned my fruitcup pruno experimentation...

415 littleoldlady  Fri, May 16, 2008 2:04:37am

re: #414 Fenway_Nation

Fenway! :-)

It's gonna be a good day! ;-)

416 gop_patriot  Fri, May 16, 2008 2:06:57am

re: #413 littleoldlady

Good morning! :) Right on time as usual. Thanks!

You know, we should really look into asking Charles for a "fruitcup automater", in case you don't actually want to be awake at 5AM Eastern Time. LOL

417 Fenway_Nation  Fri, May 16, 2008 2:07:39am

re: #415 littleoldlady


How so?

418 Fenway_Nation  Fri, May 16, 2008 2:09:35am

re: #416 gop_patriot

Good morning! :) Right on time as usual. Thanks!

You know, we should really look into asking Charles for a "fruitcup automater", in case you don't actually want to be awake at 5AM Eastern Time. LOL

You mean the Fruitcup-itron 3000?

419 redc1c4  Fri, May 16, 2008 2:09:44am

re: #414 Fenway_Nation

You'll be pleased to know I've abandoned my fruitcup pruno experimentation...

care to sample my results?

/good stuff!

420 littleoldlady  Fri, May 16, 2008 2:09:44am

gop! :-)

The truth is, I DON'T want to be awake at 5 am.

/seems I have no choice in the matter tho. :-(

I LOVE the AutoFruitcupomater™ idea!

421 JimmyTheClaw  Fri, May 16, 2008 2:10:15am

re: #406 Alberta Oil Peon

I seem to remember that many years ago, as in before the advent of personal computers, someone wrote a program that "did" psychoanalysis. The program asked questions of a "patient", read the responses (all via a terminal, IIRC), and then followed up by asking new questions derived from key words in the response. The result was eerily similar to a live psychiatrist. I guess one might call that an expert system.

I'm sure that one could write a program to converse on a political blog, but I think that the smarter people would eventually see through it.

doctor sabato was one i got it with my first sound card i had fun crashing it because it would scream as it crashed

422 littleoldlady  Fri, May 16, 2008 2:10:43am

re: #418 Fenway_Nation

Ah. Fruitcupitron®; is better!

423 littleoldlady  Fri, May 16, 2008 2:12:43am

Fenway finally cleaned up his act. Now if only we could get red to keep his grubby little hands out of the cookie jar fruitcup.

Hiya Jimmy! :-)

424 gop_patriot  Fri, May 16, 2008 2:12:48am

re: #418 Fenway_Nation

You mean the Fruitcup-itron 3000?

LOL! Love the name!

re: #420 littleoldlady

gop! :-)

The truth is, I DON'T want to be awake at 5 am.

/seems I have no choice in the matter tho. :-(

I LOVE the AutoFruitcupomater™ idea!

Argh, sorry to hear that. :/ LOL at the Autofruitcupomater. We'll have to pitch the idea to Charles!

425 redc1c4  Fri, May 16, 2008 2:17:13am

re: #420 littleoldlady

gop! :-)

The truth is, I DON'T want to be awake at 5 am.

/seems I have no choice in the matter tho. :-(

I LOVE the AutoFruitcupomater™ idea!

is that anything like a Sledge-0-Matic?

426 redc1c4  Fri, May 16, 2008 2:19:43am

re: #423 littleoldlady

Fenway finally cleaned up his act. Now if only we could get red to keep his grubby little hands out of the cookie jar fruitcup.

Hiya Jimmy! :-)

being awake @ 0500 is no big deal...

as long as yer on the way to bed. %-)

/little?

427 littleoldlady  Fri, May 16, 2008 2:22:11am

re: #425 redc1c4

Only when YOU serve the fruitcup.

Morning, red. :-)

The opposite of "on the way to bed" is "dammit, I'm awake."

428 aboo-Hoo-Hoo  Fri, May 16, 2008 2:26:29am

Are terrorists ‘phone banking’ for Barack?

[...]
The Obama campaign claimed to be “flattered” by the Hamas endorsement, and Obama himself told the Atlantic magazine that he understands why Hamas would support him.

“It's conceivable that there are those in the Arab world who say to themselves, 'This is a guy who spent some time in the Muslim world, has a middle name of Hussein and appears more worldly and has called for talks with people, and so he's not going to be engaging in the same sort of cowboy diplomacy as George Bush.’"
[...]
The support that Obama is receiving from avowed terrorist enemies of America should bother him. The fact that it does not bother him should bother us even more than the fact that terrorists see something in him that they really like.

Obama for President = blood-in-the-water anywhere within miles of a shark.

They'll be worked into an unparalleled killing frenzy come November, no matter which way the election turns-out.

Mornin folks.

429 gop_patriot  Fri, May 16, 2008 2:34:12am

re: #428 aboo-Hoo-Hoo

Good morning. That is scary, and another scary thing is what might happen if Obama doesn't get the nomination- or if he does, doesn't with the election. Many of his followers are fanatics.

430 littleoldlady  Fri, May 16, 2008 2:34:14am

aboo! :-)

You know, I sent that to a bunch of relatives the other day and I was just about to go respond to my young cousin's return email:

Don't you think it might be a step in the positive direction to have a President that at least some people in the middle east can tolerate? I mean, I know we're all infidels, but if they think that Obama can help, it means they might be open to some sort of change, or at least listening to what he has to say.

Okay, so he just graduated college a few days ago. Maybe he has an excuse for such naivete...

/or not
//I am about to do some disabusing of notions

431 gop_patriot  Fri, May 16, 2008 2:36:42am

re: #430 littleoldlady

"change", ack. LOL

Go get 'em!

432 Killian Bundy  Fri, May 16, 2008 2:36:53am
433 redc1c4  Fri, May 16, 2008 2:39:06am

re: #430 littleoldlady

aboo! :-)

You know, I sent that to a bunch of relatives the other day and I was just about to go respond to my young cousin's return email:

Okay, so he just graduated college a few days ago. Maybe he has an excuse for such naivete...

/or not
//I am about to do some disabusing of notions

try a "Clue-by-Four"... %-)

434 Killian Bundy  Fri, May 16, 2008 2:41:48am

Warning! Obesity can lead to really heavy weather

Obesity contributes to global warming, too. Obese and overweight people require more fuel to transport them and the food they eat, and the problem will worsen as the population literally swells in size, a team at the London School of Hygiene & Tropical Medicine says.

/unbelievable

435 redc1c4  Fri, May 16, 2008 2:41:53am

re: #430 littleoldlady

aboo! :-)

You know, I sent that to a bunch of relatives the other day and I was just about to go respond to my young cousin's return email:

Okay, so he just graduated college a few days ago. Maybe he has an excuse for such naivete...

/or not
//I am about to do some disabusing of notions

there is always the dreaded Troll Hammer.

436 littleoldlady  Fri, May 16, 2008 2:42:32am

re: #429 gop_patriot

Good morning. That is scary, and another scary thing is what might happen if Obama doesn't get the nomination- or if he does, doesn't with the election. Many of his followers are fanatics.

My prediction*:

1. Obama will get the nomination, in fact he already has it. The Dems can't afford NOT to give it to him for fear of alienating their black voter base.

2. Obama will lose.** And those of us who didn't vote for him will be labeled "racists". And the Grievance Theatre goes on and on...

*Disclaimer: If I was any good at predictions I'd have won the Powerball already and would be typing this from my beachfront mansion.

**Unless he is forced to choose Hillary as a running mate, in which case we're in deep doo-doo.

437 aboo-Hoo-Hoo  Fri, May 16, 2008 2:43:21am

re: #430 littleoldlady

If Obama and Co. had the slightest shred of honesty and didn't have any intention of playing into terrorist demands they would have come-out with extremely sharp retort, ie., they'd be harder on Hamas - whomever - than Bush.

But...oooh no, they provide the SoB's with script.

438 goddessoftheclassroom  Fri, May 16, 2008 2:43:41am

Good morning, Lizards!

439 redc1c4  Fri, May 16, 2008 2:44:16am

re: #434 Killian Bundy

Warning! Obesity can lead to really heavy weather

/unbelievable

naw... *this* is "Unbelieveable".

440 littleoldlady  Fri, May 16, 2008 2:45:54am

re: #435 redc1c4

If I have to send my Jewish family who spent part or all of last summer in Israel the Troll Hammer...then I may as well just give up now. :-(

goddess! :-)

441 gop_patriot  Fri, May 16, 2008 2:46:18am

re: #434 Killian Bundy

Warning! Obesity can lead to really heavy weather

/unbelievable

OH MY GOSH. This is like Bizarro-world, some ridiculous b-grade sci-fi movie alternate universe. What the heck kind of world is this when a person with this idea is printed up in the papers, and not laughed out of the room? GAH!
From the article:

Because thinner people eat less and are more likely to walk than rely on cars, a slimmer population would lower demand for fuel for transportation and for agriculture, Edwards said. This is also important because 20% of greenhouse gas emissions stem from agriculture, he added. - Reuters

And what are they going to do about those pudgier folks? Going to send them to government mandated fat-farms? OH this is just getting to be a really, really unfunny joke.

442 redc1c4  Fri, May 16, 2008 2:47:27am

re: #437 aboo-Hoo-Hoo

If Obama and Co. had the slightest shred of honesty and didn't have any intention of playing into terrorist demands they would have come-out with extremely sharp retort, ie., they'd be harder on Hamas - whomever - than Bush.

But...oooh no, they provide the SoB's with script.

if they had the slightest shred of honesty, they wouldn't be politicians...

JMHO. %-)

443 aboo-Hoo-Hoo  Fri, May 16, 2008 2:48:10am

re: #432 Killian Bundy

Oh great...

444 littleoldlady  Fri, May 16, 2008 2:49:27am

re: #442 redc1c4

DING! :-)

445 redc1c4  Fri, May 16, 2008 2:50:26am

re: #440 littleoldlady

If I have to send my Jewish family who spent part or all of last summer in Israel the Troll Hammer...then I may as well just give up now. :-(

goddess! :-)

none are so blind as those who will not see.

/never forget, never quit.

i'd say "never forgive" too, but i'm a goyim/redneck, and you *know* what we're like. %-)

446 gop_patriot  Fri, May 16, 2008 2:50:56am

re: #436 littleoldlady

I agree, it's very unlikely that Hillary will get the nomination.

And, I sure hope you're right on the "Obama will lose" part... :/

UGH The idea of Shrillary and Barak together just makes me cringe. But they've each got such egos that if made to work together, they might just implode into a black hole and take half the world with them.

/if you win powerball, I expect an invite to the beachfront mansion. Just saying.

447 bystander  Fri, May 16, 2008 2:52:07am

re: #406 Alberta Oil Peon
You are correct. The name of that "psychoanalysis" program is ELIZA.
[Link: en.wikipedia.org...]
There's even a version on the web: [Link: www.manifestation.com...]

448 littleoldlady  Fri, May 16, 2008 2:55:37am

re: #446 gop_patriot

Oh yeah! BIG LGF party!

/probably ongoing... ;-)

449 Fenway_Nation  Fri, May 16, 2008 2:56:22am

re: #437 aboo-Hoo-Hoo

If Obama and Co. had the slightest shred of honesty and didn't have any intention of playing into terrorist demands they would have come-out with extremely sharp retort, ie., they'd be harder on Hamas - whomever - than Bush.

But...oooh no, they provide the SoB's with script.

I remember when Pee-ew-go Chavez was calling Bush the devil at the UN. Hillary and Schumer at least pretended to be outraged by Chavez's over-the top speech...

450 MandyManners  Fri, May 16, 2008 3:00:12am

I feel like a white mouse.

451 Aylios  Fri, May 16, 2008 3:00:18am

Sniff, I couldnt find any comment in there by me. I obviously don't post enough.

If they do the same thing with DailyKos as part of their experiment, then perhaps it's neutral, but being academia, they most likely regard a Kossack as the epitomy of neutrality anyway, hence no need to 'experiment' on the Kossacks.

452 aboo-Hoo-Hoo  Fri, May 16, 2008 3:00:41am

re: #442 redc1c4

Hillary's a well documented chronic liar but Obama & Co. will make her look innocuous in comparison...like Little Red Riding Hood.

453 littleoldlady  Fri, May 16, 2008 3:02:29am

re: #452 aboo-Hoo-Hoo

Hillary's a well documented chronic liar but Obama & Co. will make her look innocuous in comparison...like Little Red Riding Hood.

And that was BabbaZee's theory of how Hillary would get the nomination. But I think the Obamanation snowplowed over her.

454 littleoldlady  Fri, May 16, 2008 3:03:39am

re: #450 MandyManners

But such a cute one!

Mandy! :-)

455 MandyManners  Fri, May 16, 2008 3:07:32am

re: #454 littleoldlady

Aw, shucks!

I reckon I'm just a TWM.

456 littleoldlady  Fri, May 16, 2008 3:10:27am

re: #455 MandyManners

Aw, shucks!

I reckon I'm just a TWM.

Join the club! ;-)

457 goddessoftheclassroom  Fri, May 16, 2008 3:11:29am

re: #450 MandyManners

Explain?

458 aboo-Hoo-Hoo  Fri, May 16, 2008 3:14:48am

re: #453 littleoldlady

This whole election stinks. I still can't figure-out how or who's behind Obama that he could even get in the running, we're talking very, very serious smoke-and-mirrors. Hell, from my perspective the US looks like a fish on the cutting-board about to be scaled, gutted and filleted.

ugh...

459 MandyManners  Fri, May 16, 2008 3:15:00am

re: #456 littleoldlady

Join the club! ;-)

Darlings!

460 MandyManners  Fri, May 16, 2008 3:15:22am

re: #457 goddessoftheclassroom

The experiment!

461 littleoldlady  Fri, May 16, 2008 3:17:12am

re: #458 aboo-Hoo-Hoo

This whole election stinks. I still can't figure-out how or who's behind Obama that he could even get in the running, we're talking very, very serious smoke-and-mirrors. Hell, from my perspective the US looks like a fish on the cutting-board about to be scaled, gutted and filleted.

ugh...

SOROS.

/and WHOA, aboo!

462 opnion  Fri, May 16, 2008 3:23:40am

Good Morning Lizards. I hope that no one in any way said anything that could possibly be costrued as a criticism of Obama overnight.As Michelle would say"This isn't good for my kids."
I wonder if Dems calling Bush a War Criminal was good for the twins.

463 littleoldlady  Fri, May 16, 2008 3:24:22am

re: #461 littleoldlady

Addendum:

Our elections have been about Who Has The Most Money for quite some time. We should have done something about that a looong time ago.

THIS nominating process was determined by Iowa, New Hampshire and the media. Add to that the Democrats "super delegate" system, which could turn into a situation of the elites picking the candidate...obviously we have given up the right to vote for our own President.

:-(

464 aboo-Hoo-Hoo  Fri, May 16, 2008 3:27:17am

re: #461 littleoldlady

Was/is Soro's the story behind Obama? I honestly don't know but it wouldn't surprise me.

465 LeftJustAintRight  Fri, May 16, 2008 3:29:58am

I have always been told be careful what you say
It will be used against you in one way or another
Because you posted on a thread and replyed to a troll you are now liable
Here come the blog lawsuits

466 littleoldlady  Fri, May 16, 2008 3:33:05am

Soros and Obama.

Considering the amount of money Obama is PROBABLY receiving from overseas (via the web - it's hard to track) I wouldn't be surprised if the Saudis have their hands in it as well.

467 aboo-Hoo-Hoo  Fri, May 16, 2008 3:41:05am

re: #466 littleoldlady

It makes sense. Soros is persona-non-grata in so many states I'd love to plop him down in one with a note: He's yours along with all the money you can pilfer - we'll even help where we can - just don't even think about cutting him loose.

468 littleoldlady  Fri, May 16, 2008 3:47:02am

re: #467 aboo-Hoo-Hoo

HA! :-)

Sooner or later people will catch on.

/I just hope it's not too late... :-(

469 aboo-Hoo-Hoo  Fri, May 16, 2008 3:53:48am

re: #468 littleoldlady

You and me both. :(

470 Golem14  Fri, May 16, 2008 4:14:58am

The word-count list in the directory above it ('LGF_ctf_dtf.txt') is a hoot; it has a pretty big list of variations of "bwa ha ha":

bwaaahahahahahahahahahah 1 1
bwaaahahahahhahahaha 1 1
bwaaahahhahahaha 1 1
bwaaahahahahahah 1 1
bwaaahahahhahahahahaha 1 1
bwaaahahaha 1 1
bwaaahahahahahahhahahaah 1 1
bwaaahaaa 2 2
bwaaahaaa 1 1
bwaaahaaahaaahaha 2 1
bwaaahaha 1 1
bwaaahaahahahah 1 1
bwaaahahaha 1 1
bwaaahahahahahahahaha 1 1
bwaaahhahahahahaha 1 1
bwaaah 1 1
bwaaahaaahaaa 1 1
bwaaahaaahaaahaaahaaahaaa 1

--and that's just part of it.

/bwa ha ha

471 Syrah  Fri, May 16, 2008 4:37:48am

re: #450 MandyManners

I feel like a white mouse.

For some fun, read The Hitchhiker's Guide to the Galaxy.

(The white mice are the ones running the experiment.)

The book is very good. The movie sucked bigtime.

472 celtic templar  Fri, May 16, 2008 4:49:00am

Probably a dead thread, but I found a few things digging around:

Shaking a bush, poking a shrub is the title of a page in the directory for a project called CarpetBagger and research reported at the bottom of the page:
[Link: www.cs.cmu.edu...]

Tae Yano seems to be the researcher (just traversed to ~taey)
[Link: www.cs.cmu.edu...]

This looks like the list of posts that are analyzed
[Link: www.cs.cmu.edu...]

473 celtic templar  Fri, May 16, 2008 4:50:46am

Here's the researcher's page (user name taey):
Tae Yono

474 stevieray  Fri, May 16, 2008 4:53:15am

Imagine a natural language program that could respond to comments with charm and style, sort of a robo-blogger.

Now imagine an army of them, all set to monitor a different political blog, run by a campaign manager for a politician. Add to its writing ability an encyclopedic memory, with instant access to famous quotes, historical facts, trivia, statistics, and every word ever uttered by the opposition.

You now have an army of ultimate bloggers, all completely under the control of one campaign manager... no more "going off message" by some underpaid/volunteer lackey, just high quality counter-opinion, ready to be inserted into the blogs of anyone who disagrees with your candidate.

This research will eventually lead to robo-blogging to kill emerging scandals and alternative opinions on issues... no more Rathergates as they will be smothered in the cradle by the most charming bloggers around -- the poli-bots.

475 stevieray  Fri, May 16, 2008 4:56:09am

grrr... gotta go to work now.

476 Syrah  Fri, May 16, 2008 4:57:23am

re: #474 stevieray

Imagine a natural language program that could respond to comments with charm and style, sort of a robo-blogger.

Now imagine an army of them, all set to monitor a different political blog, run by a campaign manager for a politician. Add to its writing ability an encyclopedic memory, with instant access to famous quotes, historical facts, trivia, statistics, and every word ever uttered by the opposition.

You now have an army of ultimate bloggers, all completely under the control of one campaign manager... no more "going off message" by some underpaid/volunteer lackey, just high quality counter-opinion, ready to be inserted into the blogs of anyone who disagrees with your candidate.

This research will eventually lead to robo-blogging to kill emerging scandals and alternative opinions on issues... no more Rathergates as they will be smothered in the cradle by the most charming bloggers around -- the poli-bots.

The ultimate "all your base are belong to us."

477 hopefulone  Fri, May 16, 2008 5:02:27am

re: #388 gop_patriot

Barely. Two strikes against a simple "Yes." The Washington, DC area is heavily anti-America and we are being overrun with illegal invaders. Other than that I'd say I am a proud citizen of the greatest country on earth: USA.

478 Thanos  Fri, May 16, 2008 5:03:21am

Hrmm. This is interesting to a degree, now if only the researcher had read Vinge she might actually get to something productive and useful.

In his last novel (which sucked to a certain extent, however Vernor always introduces new concepts...) he posited that TNBT might be YMAWM. (I might have that last acronym wrong.)

The protagonist(s) race to find out who has discovered that "The Next Big Thing" is a method of positioning things in cyberspace in a manner that everyone must agree with the meme. Hence "You must agree with me" YMAWM or something like that... read it last year and I can't remember.

479 Sunlight  Fri, May 16, 2008 5:08:30am

Instead of this statistical analysis, Carnegie Mellon would do well to do a value added content analysis of the public service that Charles did on exposing the white power infiltration of the anti-jihadi activities. But the academics have their template and really don't want to know anything that doesn't fit the template. I've seen it. Info bounces off of them and give them a blank look around the eyes.

480 quickjustice  Fri, May 16, 2008 5:19:38am

This could be one of LGF's best and funniest threads ever! I'm no techie, but using Charles's original link, I came up with her identity in about ten seconds:

[Link: www.cs.cmu.edu...]

The detective work here is right on target. She explains her purpose in [Link: www.lti.cs.cmu.edu...] as follows:

"We are exploring automatic methods for analyzing text in the political domain, specifically blog posts on topics pertinent to the 2008 United States Presidential Elections. Political text is often indirect, sarcastic, repetitive, hyperbolic, emotional, biased, manipulative, and riddled with unstated assumptions. Our aim is to automatically separate useful, thoughtful information from redundant "spin," using statistical natural language processing techniques and a data-driven methodology that makes use of the insights of political scientists.
The broader impact of this work will consist of a renewed emphasis exploiting domain knowledge together with text data for more powerful natural language understanding technology, as well as software tools that will promote more informed decision-making among American voters."

481 celtic templar  Fri, May 16, 2008 5:23:36am

Still poking the tree or whatever - from taey's directory:

The Political Blogosphere and the 2004 U.S. Election:
Divided They Blog

482 quickjustice  Fri, May 16, 2008 5:32:32am

She's trying for a data-mining tool tailored for blogs that separates "useful, thoughtful" information from all the mindless dreck in the blogosphere. Lotsa luck with that!

As an aside, I've been on the Carnegie Mellon campus, toured the Computer Science department, and met with CS faculty. It's a gorgeous campus. The school clearly has big bucks. CMU holds numerous contracts with various government agencies related to the information technology aspects of defense, computer security, homeland security, and similar "black ops" topics. At least some people on that campus have intimate access to NSA, DOD, and CIA. It's a spooky place.

CMU is up there with MIT and Cal Tech as a "tech savvy" university.

483 Izzy Dunne  Fri, May 16, 2008 5:33:44am

re: #363 Clemente

I'm naming my next sock "_meta_end_dot_"!

No, you're not - that's MY sock name. I'M FAMOUS!

484 Izzy Dunne  Fri, May 16, 2008 5:35:13am

re: #480 quickjustice

Political text is often indirect, sarcastic, repetitive, hyperbolic, emotional, biased, manipulative, and riddled with unstated assumptions.

Just like real speech!

485 Izzy Dunne  Fri, May 16, 2008 5:37:09am

re: #480 quickjustice

Our aim is to automatically separate useful, thoughtful information from redundant "spin," using statistical natural language processing techniques and a data-driven methodology that makes use of the insights of political scientists.
The broader impact of this work will consist of a renewed emphasis exploiting domain knowledge together with text data for more powerful natural language understanding technology, as well as software tools that will promote more informed decision-making among American voters."

We're from academia, and we're here to help.

486 Roger  Fri, May 16, 2008 5:37:57am

re: #181 itellu3times

Carnegie-Mellon, I expect it's simply natural language processing.

Yep that's what it is. They do a lot of it there.

487 Roger  Fri, May 16, 2008 5:41:38am

re: #482 quickjustice

She's trying for a data-mining tool tailored for blogs that separates "useful, thoughtful" information from all the mindless dreck in the blogosphere. Lotsa luck with that!

As an aside, I've been on the Carnegie Mellon campus, toured the Computer Science department, and met with CS faculty. It's a gorgeous campus. The school clearly has big bucks. CMU holds numerous contracts with various government agencies related to the information technology aspects of defense, computer security, homeland security, and similar "black ops" topics. At least some people on that campus have intimate access to NSA, DOD, and CIA. It's a spooky place.

CMU is up there with MIT and Cal Tech as a "tech savvy" university.

No they don't have access to NSA, DOD, and CIA. NSA stays mum and doesn't give them anything in return.

488 doriangrey  Fri, May 16, 2008 5:43:33am

re: #485 Izzy Dunne

We're from academia, and we're here to help.

Now that is a truly oxymoronic statement if ever there was one...

489 Syrah  Fri, May 16, 2008 5:47:50am

We should, for amusements sake, keep an eye open for a KOS diary about this. There may be some entertaining histrionics and conniption fits over their being the unwitting subject of DARPA funded research.

490 tgrossner  Fri, May 16, 2008 5:49:25am

So I just emailed her and asked her what this is for. Maybe I will get a response, maybe not, but worth a shot.

491 Roger  Fri, May 16, 2008 5:49:29am

re: #489 Syrah

yep, lol!

492 doriangrey  Fri, May 16, 2008 5:50:32am

re: #489 Syrah

We should, for amusements sake, keep an eye open for a KOS diary about this. There may be some entertaining histrionics and conniption fits over their being the unwitting subject of DARPA funded research.

But but but... Markos is a CIA agent... dKos is a DARPA funded research project...

493 Thanos  Fri, May 16, 2008 5:50:58am

re: #484 Izzy Dunne

Just like real speech!


She probably also doesn't understand the concept of socks, moby's, trolls, and thread-punking.

494 Izzy Dunne  Fri, May 16, 2008 5:53:36am

re: #490 tgrossner

So I just emailed her and asked her what this is for.

Professor, I think that this subject is self-aware - what do we do now?

495 Roger  Fri, May 16, 2008 5:54:47am

re: #494 Izzy Dunne

lol!

re: #493 Thanos

Oh I'll eat my shorts if the lgf dictionary wasn't applied.

496 Syrah  Fri, May 16, 2008 5:56:01am

re: #492 doriangrey

But but but... Markos is a CIA agent... dKos is a DARPA funded research project...

Don't spill the beans. The Koslings haven't figured any of that out yet. Agent Markos will have a rough time of it when he is exposed as being a double secret agent of the Zionist conspiracy. Don't blow his cover.

497 Izzy Dunne  Fri, May 16, 2008 5:57:56am

dk_ct1 := { 0.469072820883,
0.402917000567,
0.125140657881,
0.192217215807,
0.109802831418,
0.265367746874,
0.471913445441,
0.111597441468,
0.0840505921946,
}


lgf_ct1 := { 0.6901055333,
0.700371724223,
0.855324151661,
0.642811712371,
0.565627987034,
0.510338018682,
0.488326831601,
0.591053141423,
0.72730225358,
0.499085813356,
0.665689731523,
0.424181443399,
0.388326562854,
0.708542529981,
0.816460766851,
0.386040095365,
0.545539785374,
0.560502934137,
0.443370021332,
0.42877166625,
0.498025999338,
}

From this, we conclude that LGF not only has MORE numbers than DKOS, but BIGGER numbers as well.
If YOU TOO want bigger numbers, choose LGF brand blogs.

498 Roger  Fri, May 16, 2008 5:59:38am

Look in LGF_ctf_dtf.txt and find your nic. See the number of hits. BabbaZee is up there:-)

499 Shr_Nfr  Fri, May 16, 2008 6:01:07am

Give him a call [Link: people.cs.cmu.edu...] and tell him that your username is copyrighted. His use of it in any form constitutes a violation of copyright and demand that he remove it from his database. That ought to mess up his mind for a while.

500 quickjustice  Fri, May 16, 2008 6:03:17am

re: #487 Roger

Roger: I didn't mean to state or imply that CMU has "access" to the spooks' databases, for example (I don't know), but I'm certain that CMU has many contracts with the national security establishment, including DARPA. Boo! ;-)

501 Syrah  Fri, May 16, 2008 6:04:19am

re: #494 Izzy Dunne

Professor, I think that this subject is self-aware - what do we do now?

Probably won't have to do anything.

Our awareness of this study will not have any significant long term impact on her research with th possible exception of having to accommodate for nics that vary on the "meta_dot" theme.

We will do and say what we have been doing and saying.

502 Izzy Dunne  Fri, May 16, 2008 6:05:26am

re: #498 Roger

Look in LGF_ctf_dtf.txt and find your nic. See the number of hits. BabbaZee is up there:-)

Alright, alright, who has the nic of "_meta_beep_ref_lithium_meta_beep_ref_"?

503 Roger  Fri, May 16, 2008 6:07:02am

re: #500 quickjustice

There's a joke amongst them about how simple and bland the NSA folks presentations are at symposiums:-)

504 lawhawk  Fri, May 16, 2008 6:07:13am

I'm late to this party, but Carnegie Mellon did run a project to determine which blogs are the most informative and chose the top 100 blogs based on the efficiency of determining news. This could be another project along those lines - to see how information gets distributed along networked groups.

In that project, mine scored number 11 overall. Beating me out? Instapundit, Malkin, Surber, and a couple other big name blogs (LGF was not included surprisingly enough).

505 Roger  Fri, May 16, 2008 6:10:57am

re: #504 lawhawk

Cool!

506 Roger  Fri, May 16, 2008 6:13:10am

Oops. The directory index is now gone.

507 Roger  Fri, May 16, 2008 6:14:34am

I was going to search it for Gramscian Whore.

508 abolitionist  Fri, May 16, 2008 6:16:46am

re: #470 Golem14

re: #470 Golem14

The word-count list in the directory above it ('LGF_ctf_dtf.txt') is a hoot; it has a pretty big list of variations of "bwa ha ha":

[snip]

--and that's just part of it.

/bwa ha ha

That file appears to be part of a report on the frequency of occurrence of various strings, as pairs of numbers.
[Link: www.cs.cmu.edu...]

Note the phrase "experiments/post_similarity" as part of that subdir name.

509 Roger  Fri, May 16, 2008 6:19:04am

gramscian 59 37
whore 178 108

Hopefully by now they know the definition:-)

510 abolitionist  Fri, May 16, 2008 6:20:43am

re: #508 abolitionist

Obliviously, I didn't read the whole thread.

511 Izzy Dunne  Fri, May 16, 2008 6:23:15am

re: #506 Roger

Oops. The directory index is now gone.

Professor, the mice are not staying in their cages!
EEEK!

Document not found

The requested URL [Link: www.cs.cmu.edu...] does not exist. Please check your spelling and capitalization and try again.

If you believe there may be a problem with this web server, send a full copy of this error message and a description of what you were expecting to webmaster@cs.cmu.edu.

Do not send mail before checking the web tips page listed above!
You may also want to notify the maintainer of the referring page (use your browser's "Back" button).

512 chinesearithmetic  Fri, May 16, 2008 6:29:16am

Damn, and to think I made an eight-hour each-way overnight bus trip to see the Tartans beat Bethany.

513 soccerdad  Fri, May 16, 2008 6:29:57am

why did they shut it down?

514 mollyshark  Fri, May 16, 2008 6:31:26am

Was just thinking that all you people have too much time on your hands...but then it occurred to me I'm sitting here reading all this. Pot calls kettle black, wins award. More coffee needed.

515 abolitionist  Fri, May 16, 2008 6:38:44am

I went up-directory a little, manually, to [Link: www.cs.cmu.edu...]
and found this:

Talking_Points

Blog Posts
TP categories ... 1st take.
TP categories ... 2nd take.

Political blog post - parmlink collection
Political blog post - flat text collection for the 1st round
Political blog post - flat text collection for the 1st round - unzipped

Political blog post - flat text collection for the 2nd round
Political blog post - flat text collection for the 2nd round - unzipped

Political blog post - flat text collection for the 2nd round - CAT comp xml

For developers (blog corpus sample)


These items are/were links, but now seem to be dead.

516 Izzy Dunne  Fri, May 16, 2008 6:45:03am

re: #490 tgrossner

So I just emailed her and asked her what this is for..


You tipped them off, and now they cut off access!

517 ploome hineni[deleted]  Fri, May 16, 2008 6:45:29am
518 clgood  Fri, May 16, 2008 6:45:35am

Looks like the whole thing has gone 404. My guess is that she just wanted a corpus of data for some programming project and, now that the object under study is aware of her, it's no longer useful. I don't see any dark purpose here.

How evil can someone be who writes knitting software?

519 Izzy Dunne  Fri, May 16, 2008 6:47:35am

re: #513 soccerdad

why did they shut it down?

I suspect that if the mice KNOW they are in an experiment _meta_end_dot_, they will not produce the same results _meta_end_dot_ as they would otherwise, _meta_end_dot_ thus invalidating the experiment _meta_end_dot_.

520 Dad O' Blondes  Fri, May 16, 2008 6:55:05am

re: #21 zombie

Talk about pointless.

People get PhDs for this crap.

Not really pointless. The "value" of this may be questionable though, at this point in time.

As the web and bogosphere has grown exponentially in influence, there has been great interest in determining if real life outcomes can be affected by influential opinions posted on blogs -- and then re-created in numerous other places to mimick majority viewpoints.

There are numerous companies invloved in this research-like activity which can be tailored for marketing, business intelligence financial trading, political campaign managment uses (for example, Umbria of Denver, CO was just sold to JD Power).

They use primarily data visualization tools (like those used now by lawyers engaged in electronic data discovery) --

These are similar to the software tools used by intelligence services to monitor, track, analyze, for example, wireless (web/phone) transmissions and discussions originating in the US, and destined for overseas delivery in places like, say, Iran.

.

521 Kenneth  Fri, May 16, 2008 6:55:59am

Here's another paper at the website:

The Political Blogosphere and the 2004 U.S. Election:
Divided They Blog

4 March 2005
Abstract

In this paper, we study the linking patterns and discussion topics of political bloggers. Our aim is to measure the degree of interaction between liberal and conservative blogs, and to uncover any differences in the structure of the two communities. Specifically, we analyze the posts of 40 “A-list” blogs over the period of two months preceding the U.S. Presidential Election of 2004, to study how often they referred to one another and to quantify the overlap in the topics they discussed, both within the liberal and conservative communities, and also across communities. We also study a single day snapshot of over 1,000 political blogs. This snapshot captures blogrolls (the list of links to other blogs frequently found in sidebars), and presents a more static picture of a broader blogosphere. Most significantly, we find differences in the behavior of liberal and conservative blogs, with conservative blogs linking to each other more frequently and in a denser pattern.

Kind of interesting study on the interactions of political blogs. not malevelant, but possibly useful and powerful. On a related note, I read somewhere that the Obama campaign now owns the largest database of Democratic voters ever compiled. This is a product of their significant use of the web in fund raising. This database will give Obama huge power within the Democratic party for years to come, no matter what happens in the election.

522 abolitionist  Fri, May 16, 2008 6:59:10am

re: #515 abolitionist

The "Talking_Points" item there points to another site (princeton.edu):
[Link: wordnet.princeton.edu...]

This appears to be a research tool, involving a Perl script.

523 Roger  Fri, May 16, 2008 7:09:30am

I think we were in temporary folder land.

524 Kosh's Shadow  Fri, May 16, 2008 7:09:50am

The've shut it. Now we'll never know the question whose answer is 42

525 Izzy Dunne  Fri, May 16, 2008 7:17:33am

re: #524 Kosh's Shadow

The've shut it. Now we'll never know the question whose answer is 42

What is 21 times 2?


I'll take "Arithmetic" for $400, Alex.

526 Pass The Moonbaticide  Fri, May 16, 2008 7:23:49am

Dang ... They've pulled it. So I'll never know what experiments
they've been performing on my comments. Double Dang !
Nor do I really care, Frankly. There are far more interesting Lizards than me. Some are certifiable, but we love them.

Yoo Hoo , Over Here , Investigators ...
Your test subjects are aware of your existence so the results are void.
Right ?

527 katemaclaren  Fri, May 16, 2008 7:27:38am

re: #524 Kosh's Shadow

The've shut it. Now we'll never know the question whose answer is 42

Oh yes, Hitchhiker capable of Deep Thought.
I do. What is the meaning of life? Or should I reverse that jeopardy-style?

528 Syrah  Fri, May 16, 2008 7:28:56am

re: #526 Pass The Moonbaticide

Yoo Hoo , Over Here , Investigators ...
Your test subjects are aware of your existence so the results are void.
Right ?

Not necessarily.

What behavior would you change?

529 Shug  Fri, May 16, 2008 7:31:37am

Shut down.

I have no idea what this was. I hate being completely in the dark. I feel like one of those undecided voters who has yet to make up his mind between McCain and Obama

completely clueless

530 katemaclaren  Fri, May 16, 2008 7:34:17am

re: #528 Syrah

Not necessarily.

What behavior would you change?

well, for one thing, we're talking about THEM.

531 ChipDWood  Fri, May 16, 2008 7:39:15am

Yea- shut down. I didn't get to it in time to even see what it was all about.

So, now I'm depressed. And stuff.

532 Pass The Moonbaticide  Fri, May 16, 2008 7:41:43am

re: #528 Syrah
True , that. We're not going to be any less the Lizard now that we know we're under the Investigator's lens.
PS Didn't note my comment hit count whilst the site was up, did you ?

533 Syrah  Fri, May 16, 2008 7:46:35am

re: #530 katemaclaren

well, for one thing, we're talking about THEM.

We talk about a wide range of topics.

A conversation about them will last only a couple of days.

If they were working on data mining strategies and techniques that were limited to only that narrow window in which we are talking about them, it might put a wrinkle in the study. For a study over a longer period of time, this moment of subject awareness will have very little if any measurable effect.

That which drives us to post on a public forum, and to add commentary that already can be read by anyone with a net connection will not be changed by the awareness that someone is looking. If we lived in a culture where our commentary could result in an invitation to a gulag, things might have changed significantly. Right now, I don't think anything will change except their database security features.

534 zmdavid  Fri, May 16, 2008 7:48:06am

Don't be too angry with the lizard who tipped them off, they would have cut off access anyway once they saw this thread. I'm disappointed I didn't see it before they cut it off.

535 Syrah  Fri, May 16, 2008 7:50:13am

re: #532 Pass The Moonbaticide


True , that. We're not going to be any less the Lizard now that we know we're under the Investigator's lens.
PS Didn't note my comment hit count whilst the site was up, did you ?

No I didn't notice.

I will see if I can find that for you and post it later this evening or tomorrow.

536 katemaclaren  Fri, May 16, 2008 7:50:53am

re: #533 Syrah

An intelligent response to my silly comment! Everyone but me seems brighter in the morning. ;-)
I was thinking of mice, too, how the mice must also look up occasionally and see the scientists eyeing them.

537 Raydog  Fri, May 16, 2008 7:51:56am

Big brother is watching.

538 Syrah  Fri, May 16, 2008 7:55:58am

re: #536 katemaclaren

An intelligent response to my silly comment! Everyone but me seems brighter in the morning. ;-)
I was thinking of mice, too, how the mice must also look up occasionally and see the scientists eyeing them.

I think that as long as the mice don't confuse us with Arther Dent, we will be safe.

Off to work.

BBL

539 Yankee Zionist  Fri, May 16, 2008 8:02:36am

re: #294 cargocultist


I agree.

540 Yankee Zionist  Fri, May 16, 2008 8:05:50am

We need William Gibson to get us out of this mess.

It's just too meta.

541 Joan  Fri, May 16, 2008 8:14:00am

re: #21 zombie

Talk about pointless.

People get PhDs for this crap.

Not pointless if properly exploited. "Carnegie Mellon" for status, with "Cross Checked Analysis" and other credible methodological phrases to charm and bedazzled the laity, fodder for feeble minded media propagandists.

Remember, there was that psychobabble study proving that conservatives are stooopiddder than liberals. Or some such.

Studies from places like CMU can serve our adversaries. Cherry-picking and tendentious research can be used to demonize and marginalize normality, distortion and ridicule can be bolstered with factoids. Yes, scholarship can be useful in the long march to serfdom that retrograde "progressives" have envisioned for the great unwashed proletariat. I cling to my guns, to my stunted creed and shun the outlanders and unbelievers...just as Leader Obama suspected all along.

542 coquimbojoe  Fri, May 16, 2008 8:14:37am

Has it been removed?

543 haakondahl  Fri, May 16, 2008 8:30:12am

re: #365 ghost707

Uh no.
The DOS was your idea. Not usually what I think of while in diags.
Sense of humor required.

If you run diags with the intent to bring service to a crawl, that is a DOS attack. As opposed to data destruction, data retrieval, or malware propogation attacks.

544 Steve  Fri, May 16, 2008 8:35:20am

I believe that it has been removed. I got this:

Document not found
The requested URL [Link: www.cs.cmu.edu...] does not exist. Please check your spelling and capitalization and try again.

If you believe there may be a problem with this web server, send a full copy of this error message and a description of what you were expecting to webmaster@cs.cmu.edu.


Do not send mail before checking the web tips page listed above!
You may also want to notify the maintainer of the referring page.

Carnegie Mellon University School of Computer Science


---

webmaster@cs.cmu.edu

545 debutaunt  Fri, May 16, 2008 8:37:35am

re: #531 ChipDWood

Yea- shut down. I didn't get to it in time to even see what it was all about.

So, now I'm depressed. And stuff.



Poor baby, and I say that with full knowledge that someone may connect the phrase 'poor baby' to another comment made about some completely different whine.

546 Dark_Falcon  Fri, May 16, 2008 8:47:45am

re: #542 coquimbojoe

Has it been removed?

Looks like it. If they are reading this site, it was only a matter of time till they caught on. That's only problem with what Charles and Zombie did: they taught the people in question a lesson in internet security. They'll not likely make the same mistake. Still, at least we made them look silly. :)

547 debutaunt  Fri, May 16, 2008 8:50:46am

re: #546 Dark_Falcon

Looks like it. If they are reading this site, it was only a matter of time till they caught on. That's only problem with what Charles and Zombie did: they taught the people in question a lesson in internet security. They'll not likely make the same mistake. Still, at least we made them look silly. :)

Yes! Our work is done - take the rest of the day off and have a great weekend!

548 Charles  Fri, May 16, 2008 8:55:39am

I would love to have seen the administrator's face when he/she saw thousands of hits coming from LGF...

549 Athos  Fri, May 16, 2008 8:58:13am

re: #546 Dark_Falcon

Looks like it. If they are reading this site, it was only a matter of time till they caught on. That's only problem with what Charles and Zombie did: they taught the people in question a lesson in internet security. They'll not likely make the same mistake. Still, at least we made them look silly. :)

There is a certain level of irony that those at Carnegie Mellon's Computer Science department working on this had to be taught a lesson in internet security. Carnegie Mellon is the home to
CERT As they say on their home page - 'We study internet security vulnerabilities, research long-term changes in networked system, and develop information and training to help you improve security."

This goes beyond looking silly. It's called looking pathetic.

550 Athos  Fri, May 16, 2008 8:59:42am

re: #548 Charles

I would love to have seen the administrator's face when he/she saw thousands of hits coming from LGF...

I would like to have seen a few faces when they realized just how far you browsed through their system...gatekeepers of internet security..buhhahahahaha

551 tonysdca  Fri, May 16, 2008 9:01:27am

re: #520 Dad O' Blondes

There are many legit commercial uses for both structured data mining, and unstructured textual analysis technology. About 8 years ago I ran a co. with a total of 1,100 employees, and over 150 of them were Ph.D. mathematicians. We had software that mined structured data and also analyzed unstructured data (email, CS comments, etc.). Put it all in a back-prop neural network, stir it up. Bingo. For one major telecom company, we could predict 60% of their customers that were going to churn (leave) within the next 90 days with

552 heretic  Fri, May 16, 2008 9:07:45am

So... then. How did Charles trip across the site in the first place, if it's supposed to be secure and everything? Is there an Enigma machine set up to surf the internet for all mentions everywhere of LGF?

553 Catttt  Fri, May 16, 2008 9:15:52am

Darn. I didn't get to look at it. Humpf.

554 yma o hyd  Fri, May 16, 2008 9:25:37am

re: #553 Cattt

Darn. I didn't get to look at it. Humpf.

Me neither, likewise, also, as well.
Now I'll never know.
Humpf indeed.

555 non-croyant  Fri, May 16, 2008 9:30:37am

My mental picture of most of the people commenting on this article:

[Link: www.thetravelrag.com...]

556 aaron  Fri, May 16, 2008 9:33:29am

re: #548 Charles

I would love to have seen the administrator's face when he/she saw thousands of hits coming from LGF...

I'd love to see their face when someone inquires with their Institutional Review Board regarding using the Lizardoid Minions as human research subjects.

Anyone here recall signing a consent form?

/...just sayin'

557 Yankee Division Son  Fri, May 16, 2008 9:35:39am

re: #149 Yankee Division Son

You're assuming they actually check the logs.. with directory browsing enabled and permissions open I tend to doubt it..

You were right again Charles!

/not shocked.. at all :)

558 non-croyant  Fri, May 16, 2008 9:44:52am

re: #549 Athos

There is a certain level of irony that those at Carnegie Mellon's Computer Science department working on this had to be taught a lesson in internet security.

I think it's sad that so many in these comments are making so much out of minor security issues. You're all chuffed because you taught yourselves something about simpleminded Web server administration when most of you wouldn't know what an NP-complete problem was if it slapped you in the face.

Get over yourselves. Most of you would be a damn lucky to have the opportunity to matriculate at Carnegie Mellon, especially in the CS dept there.

If Tae Yano is reading this she's laughing at you.

559 taxfreekiller[deleted]  Fri, May 16, 2008 9:58:33am
560 Athos  Fri, May 16, 2008 10:06:51am

re: #558 non-croyant

In InfoSec best practices, there is no reason why anyone from outside the network should be able to browse in the manner that was done. A system is either secure or it is insecure. The use of the term 'minor security issue' is just an effort to marginalize the embarrassment that they should feel. If it is only 'minor' its because the intent was benign rather than malicious.

As for choice of schools to matriculate from - that's nothing more than a non-sequitor ad hominen and immaterial. You don't know from what schools most of us matriculated from or our backgrounds. But, please, continue to add to your 4 or 5 posts per month and tell us once again how intellectually inferior we are supposed to be.

561 tonysdca  Fri, May 16, 2008 10:07:14am

re: #558 non-croyant

You make valid points, but Carnegie Mellon IS a US (and UK) defense contractor in computer science and technology, health care and other areas. Let's hope those areas are partitioned and much better managed for security.

562 Electron Shuffler  Fri, May 16, 2008 10:07:30am

re: #552 heretic

Charles was probably using his Lizard Overlord Tool to look at who was hitting LGF over time. Probably he noticed that something was acting like a indexing bot, (google, yahoo, msn etc.) but not doing bot like things like following random links. It was just following the links to the comments section for each main post.

563 DistantThunder  Fri, May 16, 2008 10:13:13am

Looks like I missed the unauthorized foray.

564 taxfreekiller[deleted]  Fri, May 16, 2008 10:26:48am
565 Electron Shuffler  Fri, May 16, 2008 10:42:45am

re: #564 taxfreekiller

Huh?

You mean other than the other gobs of money that are given to institutions of Carnegie Mellon's size as grants?
For how can I say... politically motivated research?

Please explain.

Work calls, I will read later.

566 taxfreekiller[deleted]  Fri, May 16, 2008 10:51:34am
567 soccerdad  Fri, May 16, 2008 11:23:42am

re: #548 Charles

I would love to have seen the administrator's face when he/she saw thousands of hits coming from LGF...

I would love to see how many hits came from LGF vs. KosKiddies.

Think they figured it out?

568 tgrossner  Fri, May 16, 2008 12:41:41pm

Everyone should take a look at the response I got from the email sent to Tae Yano in regards to their project. His response is pretty straightforward and I personally find nothing insidious about this project. We as a community put our thoughts and opinions out there in public, and we shouldn't fear the use of such information.

[Link: www.cs.cmu.edu...]

Tim

569 bosforus  Fri, May 16, 2008 1:03:37pm

re: #568 tgrossner

Everyone should take a look at the response I got from the email sent to Tae Yano in regards to their project. His response is pretty straightforward and I personally find nothing insidious about this project. We as a community put our thoughts and opinions out there in public, and we shouldn't fear the use of such information.

[Link: www.cs.cmu.edu...]

Tim

That is an awesome email response. Kudos to them for their work. They seem like decent enough people.

570 bosforus  Fri, May 16, 2008 1:05:24pm
We did look at "comment prediction" - trying to predict
which particular posts individuals will comment to (as a proxy for
predicting what they'd actually be interested in reading).

Sweet! I'd love to see those results!

571 PloniAlmoni  Fri, May 16, 2008 1:36:14pm

re: #568 tgrossner
Nice! The mystery is solved and a good time was had by all. Hooray!

572 haakondahl  Fri, May 16, 2008 1:46:34pm

re: #296 JustMyView

Nicely done! Will be curious about the response.

May 16, 2008

Dear Little Green Footballers,

We are Noah Smith and William Cohen, two faculty members at Carnegie
Mellon University. In collaboration with our Ph.D. student Tae Yano,
we have been studying text on the political blogosphere. Tae has been
posting various intermediate results of her research on her web page
for discussion among ourselves, and forgot to modify the read
permissions on the directory to make it invisible outside CMU.
There's nothing sensitive here - just research code and some
statistics we've calculated on blog data, including Little Green
Footballs and many other political blogs. We took the data down this
morning while deciding how to respond to you, but will probably put it
back up soon (perhaps with more understandable comments).

We are researchers in the fields of Machine Learning and Natural
Language Processing (that's what "NLP" stands for in this context, as
itellu3times guessed, not Neurolinguistic Programming). In a
nutshell, this means we are computer scientists who develop algorithms
that use text data in order to automate tasks involving text in
languages like English. As many of you guessed, this often involves
statistical analysis of text data. If the idea that computer programs
are "watching" your posts and comments and counting words is
disconcerting, consider that this sort of automated statistical
analysis is what makes search engines work (and probably what led you
to find our files).

Read on. There's more!

573 Electron Shuffler  Fri, May 16, 2008 1:49:39pm

re: #566 taxfreekiller

Thanks,

I always thought that G. Soros was left fuming that Kerry lost.
What they are up to, I bet that they are holding it until the election really gets rolling. It's still too early to show what they have. What part this has as step x in their planning. I don't really know. I have some tin foil hat ideas, but more likely what we had a peek at is some sort of macro analysis tool for evaluating
our opinions, and what effects their actions have on those opinions.

Soros, great at running corporations, a real stinker politically.
-
Time to go home and mow the lawn and then get myself a beer.

574 Roger  Fri, May 16, 2008 1:55:45pm

re: #568 tgrossner

Poor bastards are lib er als! :-)

It's cool and their reply and response is cool.

Say Noah, Tae and William, some questions it would be easier to just ask us! :-) We'd most likely speak our minds. Notice how heterogeneous the lgf community is where we have many things on which lgf'ers disagree.

/Bet you're glad browsers have added spell check :-)

//We'll try to work on the grammar:-)

///The lgf dictionary and some of the historical context might be helpful.

575 quickjustice  Fri, May 16, 2008 1:56:04pm

re: #568 tgrossner

Good work, Grossner! I found the people I dealt with at CMU to be very bright and very pleasant-- even the spooky ones, and the liberals! ;-)

576 Electron Shuffler  Fri, May 16, 2008 1:59:26pm

Nut's, the stuff that's found while you are typing.

Good Job!

Nice of them to let us know what was up.
Other researchers might have thrown a attitude and said that we have no right to know what they were doing, and to get lost.

I'ts great that they were straight with those of you who contacted them.

577 Hopefulone  Fri, May 16, 2008 2:39:42pm

re: #568 tgrossner

It's a great response but *my*concern is not privacy of what commenters say but how data can be used. "Comment prediction" has to be understood as a slice of the picture--and a picture which varies a lot from group to group. THAT would be interesting to see in their analysis. We can predict how some online characters will post, in general (trolls, flamers, whatever), but we certainly don't *know* them. How will the analysis be used?

578 tgrossner  Fri, May 16, 2008 3:26:30pm

re:#577 Hopefulone

Agreed, but behind every bush and tree is not a sniper. Until something proves itself to be a bad thing I give it the benefit of the doubt.

Its a given that anytime you contribute postings to the Interwebtubes, you lose "control" over it. Thats just a fact, and is part and parcel with why the Interwebtubes is a free and open model for the exchange of information.

Tim

579 taxfreekiller[deleted]  Fri, May 16, 2008 3:38:25pm
580 Celtic Templar  Fri, May 16, 2008 4:47:32pm

re: #558 non-croyant

I think it's sad that so many in these comments are making so much out of minor security issues. You're all chuffed because you taught yourselves something about simpleminded Web server administration when most of you wouldn't know what an NP-complete problem was if it slapped you in the face.

Get over yourselves. Most of you would be a damn lucky to have the opportunity to matriculate at Carnegie Mellon, especially in the CS dept there.

If Tae Yano is reading this she's laughing at you.

Hey, give me a week and I'll have amassed a boat load of information about these researchers that would scare them into never coming out of their basements.

And that is without violating any ethics clauses that I'm bound by. I'm not chuffed, I find it fun to track down those tracking me.

It's all a game and I play it better than you.

581 Hopefulone  Fri, May 16, 2008 6:03:44pm

re: #578 tgrossner

It certainly depends on how this analysis -> tool -> product -> practical application is used. Can it be exploited for deception which is very difficult to detect? Open algorithms certainly reduce that concern. I don't want the next generation of cooked-up journalism to pollute the network of knowledge with false knowledge derived from statistics. Give me hints of sources of information that I may want to access but don't present the statistical analysis as the new reality.

582 non-croyant  Fri, May 16, 2008 6:53:28pm

re: #580 Celtic Templar


It's all a game and I play it better than you.

Maybe and maybe not (I just happened upon this reminiscence while looking up my old posts in connection to another matter):

[Link: searchforjesus.beliefnet.com...]


ElCid22
8/26/2003 1:22 PM 14 out of 185

My personal threat level is at an all-time high right now. Actually, it sorta scares me when I take time to ponder its potential. My real-life, would-be enemies include:

1. My internet enemies Non-Croyant provided with my name, address and phone number.
2. The many people I helped send to prison or confinement therapy.
3. The gang-bangers up the street whom I confronted for flipping me off.
4. The father of one of the gang-bangers, who just happens to be a Juvenile Court Judge.
5. Somebody at work who hates my stinking guts and wants to put me down, hard.

Whenever the phone rings and I answer it, only to discover the slug on the other end chooses to remain to silent, I wonder which category he falls into. It's a good thing I pack heat. It might save my crimson redneck someday.

Fun times . I had even posted a satellite photo of his house.

Anyway, cybersleuthing is not a big deal when you have been online since 1989 and know what tools are available. It's a best a script-kiddie level of skill - not that it isn't fun.

I'd be far more impressed by these students who can prove whether or not a problem is NP-complete and can then offer a FPA for handling it.

583 WildBill  Fri, May 16, 2008 7:37:02pm

My complements to the researchers on their response. It seems open and doesn't ring hollow.

Look if you said they had queried for the underlying identity of everyone making comments I'd be upset/concerned but these comments are public; they and the rest of the world are welcome to them, and you can compile and discect etc. to your hearts content as long as the only reference is to my account name here on LGF :-)

584 Charles  Fri, May 16, 2008 7:47:49pm

re: #558 non-croyant

I think it's sad that so many in these comments are making so much out of minor security issues. You're all chuffed because you taught yourselves something about simpleminded Web server administration when most of you wouldn't know what an NP-complete problem was if it slapped you in the face.

Get over yourselves. Most of you would be a damn lucky to have the opportunity to matriculate at Carnegie Mellon, especially in the CS dept there.

If Tae Yano is reading this she's laughing at you.

This is one of the most ignorant comments ever posted at LGF on a tech subject, obviously from a naive idiot who knows nothing about web security.

There are no minor security issues, and anyone foolish enough to believe this is an exploit waiting to happen.

I hope you're not in charge of anything related to security at the University of Tennessee, Knoxville.

585 Charles  Fri, May 16, 2008 8:10:05pm

And after reviewing the other comments posted by "non-croyant," who needs this insulting, arrogant college kid around?

Banned.

586 Repj  Fri, May 16, 2008 8:13:18pm

Excellent ban, Charles. With all the privacy issues via the computer and cameras in RL, I think I should be allowed to walk around naked all the time. It's not like my clothes are actually hiding anything anymore. Think my kid will mind?

587 Render  Fri, May 16, 2008 9:04:34pm

William, Noah, and especially Tae,

It's been a pleasure to make your acquaintances. Very well done.

LIZARD
TOOL
#64,
R

588 freedombilly  Fri, May 16, 2008 10:12:28pm

Academics who are liberals?

I am shocked!

589 Thanos  Sat, May 17, 2008 5:58:25am

Here's the Vernor Vinge book I referenced above, I couldn't remember the title at the time. The book is weak in literary terms of conflict and character but strong on conceptualizing possible futures, I recommend it to all Lizards.

590 joegosox  Sat, May 17, 2008 6:38:34am

re: #585 Charles

Thank You.

591 Syrah  Sat, May 17, 2008 6:48:37am

re: #535 Syrah

No I didn't notice.

I will see if I can find that for you and post it later this evening or tomorrow.


I am not sure if I am posting the right thing here. This might be what some have called their "hitcount. When the research team puts this all back up, You might want to contact Tae directly for an explanation of how the data is collected and used. "hitcount" may be the wrong term.

At the former location of
[Link: www.cs.cmu.edu...]

About halfway down the page was:


moonbat 687 314
moonbatachusetts 2 2
moonbatacide 1 1
moonbatattack 1 1
moonbatbane 15 8
592 Skinless Frank  Sat, May 17, 2008 8:12:13am

@ #588: After reading their response, and going through the paper they cite, I find I don't care whether they describe themselves as liberals or not. They seem like reasonable, intelligent, curious, and open-minded people. I'm certain that if we were to disagree about something, it would be for reasons I could understand and respect, and I could leave the clue bat in the closet. The contrast with the true-believers on either side (Daily Kos or Brussels Journal, for example) is refreshing.

And Charles, the ban-hammer was used most appropriately.

593 efaust93  Sat, May 17, 2008 1:20:52pm

Security is rarely a concern in an academic environment. Where I work, we have to deal with academics a lot of the time (even with CMU sometimes) and security is rarely their top concern.

I do find it odd since CMU is one of the leaders in the Computer Security field. I guess, like most large entities, the departments don't talk to each other.

594 zerodamage  Sat, May 17, 2008 4:02:28pm

This is late a coming and probably no one will see this, but they have some serious issues with their web server. Now maybe this is coming from a router with a single IP forwarding ports to different machines, but still, this looks really bad. There should at least be some firewall rules keeping out the rest of the world from accessing ports that belong to the MYSQL server. I am no coder but I am a security guy and this scares me. I would never set up my web server with this much open to the world.

Every port you can imagine is open on this machine. TELNET! Even MYSQL is open to the world. Scary stuff. If someone is talking to these people, tell them to secure this thing. Wow!

Host michelangelo.srv.cs.cmu.edu (128.2.203.164)

Not shown: 1681 closed ports
PORT STATE SERVICE VERSION
7/tcp open echo
9/tcp open discard?
13/tcp open daytime Sun Solaris daytime
19/tcp open chargen
22/tcp open ssh OpenSSH 2.5.1p1 (protocol 1.5)
23/tcp open telnet BSD-derived telnetd
25/tcp open smtp
37/tcp open time (32 bits)
42/tcp filtered nameserver
53/tcp open domain
79/tcp open finger Sun Solaris fingerd
80/tcp open http Apache httpd 1.3.27 ((Unix) mod_pubcookie/3.0.0 beta3 mod_ssl/2.8.12 OpenSSL/0.9.6g)
111/tcp open rpcbind 2-4 (rpc #100000)
119/tcp open nntp?
135/tcp filtered msrpc
137/tcp filtered netbios-ns
138/tcp filtered netbios-dgm
139/tcp filtered netbios-ssn
443/tcp open ssl/http Apache httpd 1.3.27 ((Unix) mod_pubcookie/3.0.0 beta3 mod_ssl/2.8.12 OpenSSL/0.9.6g)
445/tcp filtered microsoft-ds
515/tcp filtered printer
544/tcp open kshell?
593/tcp filtered http-rpc-epmap
1433/tcp filtered ms-sql-s
1434/tcp filtered ms-sql-m
1439/tcp open eicon-x25?
1443/tcp open ies-lm?
1720/tcp filtered H.323/Q.931
1993/tcp filtered snmp-tcp-port
3389/tcp filtered ms-term-serv
4045/tcp open nlockmgr 1-4 (rpc #100021)
6050/tcp filtered arcserve
32776/tcp open status 1 (rpc #100024)

595 KCrary  Sat, May 17, 2008 7:21:24pm

I'm late to the party, but for what it's worth: CMU doesn't use a firewall. We made a decision to leave the network open and make security an individual responsibility. It's much more convenient that way; plus, with all the wireless access, a firewall would only give a false sense of security.

In this case, a student left some data on a publicly accessible web server. I'm sure she didn't expect the Lizard army to get ahold of it, but if she felt that data were actually sensitive, I'm sure she wouldn't have done so.

Most of the servers around here run the usual assortment of commercial and open-source software, so there's almost certainly attacks that could be run against them. I'd appreciate it if you didn't.

596 zerodamage  Sat, May 17, 2008 8:14:56pm

re: #595 KCrary

It is just a matter of time before that system gets hit some Russian or Chinese hacker. Look at the open services on that machine. Even if security is the individual's responsibility, it is still the University's responsibility to educate them. The same goes for the University I work at. It's sad enough that it is a computer science server.


This entry has been archived.
Comments are closed.

^ back to top ^

log in
Name:
Pass:

Register Forgot Your Password? My Account Re-send Confirmation (To log in, cookies must be enabled in your browser!)

► LGF Headlines

  • Loading...

► Top 10 Comments

  • Loading...

► Bottom Comments

  • Loading...

► Recent Comments

  • Loading...

► Tools/Info

► LGF Hits

► Slideshows

► Resources

► Never Forget

► Statistics

► Tag Cloud

► Contact

You must have Javascript enabled to use the contact form.
Your email:

Subject:

Message:


Messages may be published in our weblog, unless you request otherwise.
Tech Note:
Using the Contact Form

► News/Opinion

  • Loading...

More Partners

Compare Electricity Prices in your area. Texas Electricity is deregulated; you have the right to choose Texas Electric Rates from among many Texas Electric Companies.

Hell hath no fury like an accountant scorned.

Follow Lizardoid on Twitter

 Frank says:

Consider for a moment any beauty in the name Ralph. -- In an interview with Joan Rivers who had just asked him why he gave his children such odd names, Frank gave the reply above.