LGF
LGF Search: (advanced)
 
News Search:

  
log in
Name:
Pass:

Register (closed) Forgot Your Password? My Account Re-send Confirmation (To log in, cookies must be enabled in your browser!)
You must have Javascript enabled to use the contact form.
Your email:

Subject:

Message:


Messages may be published in our weblog, unless you request otherwise.
Tech Note:
Using the Contact Form

now playing

No CD playing.

Recently Played

Advertisement

Reading links...

  

Link address:
Link title:
Description: 
Remaining:

An LGF Experiment at Carnegie Mellon?

Thu, May 15, 2008 at 10:44:31 pm PST

Anyone know what this page at Carnegie Mellon means? It’s some kind of experiment that involves comments posted at LGF, and I have a feeling it’s not friendly.

Here’s a directory index.

UPDATE at 5/16/08 8:40:05 am:

They finally got around to closing their directory indexes; some administrator had quite a surprise this morning upon checking the server logs...

UPDATE at 5/16/08 2:49:52 pm:

A response from the researchers...

596 comments

  • Comments are open and unmoderated, and do not necessarily reflect the views of Little Green Footballs.
  • Obscene, abusive, silly, or annoying remarks may be deleted, but the fact that particular comments remain on the site in no way constitutes an endorsement of their views by Little Green Footballs.
  • Posts that contain phone numbers, addresses, or other personal information will also be deleted, as will posts that consist only of a variation on the word, "First!"
  • Comments that advocate violence will be cause for immediate banning with no appeal.
  • REMEMBER: posting comments at LGF is a privilege, not a right. Abuse that privilege, and your account will be blocked.

Hide comments

#1 pat 5/15/08 10:46:46 pm 0

No, but I don't think I should. Pat Benatar. Hmmmm.

#2 song_and_dance_man 5/15/08 10:47:18 pm 0

Jealously research?

#3 DesertSage 5/15/08 10:47:42 pm 1

Are they searching under the Denver airport yet?

#4 RTLM 5/15/08 10:48:41 pm 0

Gee, you guys are all famous !

#5 zombie 5/15/08 10:48:46 pm 0

Oh, that is weird.

Maybe it's post-modern poetry, academia-style? Using LGF comments as gibberish to transcend interpretation?

#6 Render 5/15/08 10:49:03 pm 0

[Link: www.cs.cmu.edu...]

A comparison of "post similarity" between LGF and Kos?

WEIRDNESS,
R

#7 song_and_dance_man 5/15/08 10:50:16 pm 0

song_and_dance 4 3
song_and_dance_man 2150 260
song_and_danceman 1 1
song_and_wrath_man 2 2
songanddanceman 2 1
songberd 1 1


I so want a sock called song and wrath man, but it's not my style. Not the wrath thing, but posting under a new name.

#8 Dan G. 5/15/08 10:50:17 pm 0

"/experiments/post_similarity/" (from the URL)

Suggests that it is an algoritm to detect people posting with multiple handle, maybe? To see if the posts' contents can be used to ID the poster w/o regard to the handle? Just guessing.

More info on this particular project...

#9 zombie 5/15/08 10:51:56 pm 0

Oooh, I'm in there too, several times. Here I am talking about "Anthem," the Ayn Rand novella (formatted Carnegie-Mellon stylee):

re _meta_number_ref_ zombie anthem is the only book of hers i ever recommend

you can tell you re over the target by the amount of flak you are getting _meta_end_dot_

re _meta_number_ref_ mandymanners amen

re _meta_number_ref_ zombie

re _meta_number_ref_ zombie jwm

re _meta_number_ref_ zombie anthem and it s more like a novella or long short story you can read it in a day or two _meta_end_dot_ highly recommended a completely devastating critique of collectivism and communistic society _meta_end_dot_ that s the only rand book that i have read i read it in _meta_number_ref_th grade along with animal farm

#10 Dan G. 5/15/08 10:51:58 pm 6

Ta Da!

Here is the paper.

#11 DesertSage 5/15/08 10:52:00 pm 0

That's some weird shit they got going on there.

#12 morganm 5/15/08 10:52:00 pm 0

You can download the python scripts one directory up and there's a Kos directory too. Probably just too what they consider to be partisan sites to test some kind of language algorithms or something, but it's kind of fun to see the actual code.

[Link: www.cs.cmu.edu...]

#13 pat 5/15/08 10:52:38 pm 0

Monitors
In
The
Classroom

P

#14 Opilio 5/15/08 10:52:56 pm 0

Hmmmm. URL includes "post similarity". And a bunch of lizard names: zombie, babbazee, alouette, cattt, doriangray, savage, mandy...

Most of the snippets on the page you linked to seem to originate from lionheart/BNP period.

#15 zombie 5/15/08 10:52:56 pm 0

Maybe this page explains something?

#16 Charles 5/15/08 10:53:46 pm 1

re: #10 Dan G.

Ta Da!

Here is the paper.

But that's an old paper from 2004. The comments in the first link above are much more recent.

#17 fluffy 5/15/08 10:53:48 pm 0

They use python, which is more lizard oriented than perl.

A quick look made me wonder if they were grabbing comments from the lounge.

#18 nextstopmars 5/15/08 10:53:55 pm 0

from readme on the /experiments page

Keep only the staff need to be on the web site.
most of the real experiment results should be kept in:
malbec.ark.cs.cmu.edu/~polit/


Obviously it's a KGB plot.

#19 Dan G. 5/15/08 10:54:38 pm 0

The Political Blogosphere and the 2004 U.S. Election:
Divided They Blog
4 March 2005

Abstract
In this paper, we study the linking patterns and discussion topics of political bloggers. Our aim is to measure the degree of interaction between liberal and conservative blogs, and to uncover any differences in the structure of the two communities. Specifically, we analyze the posts of 40 “A-list” blogs over the period of two months preceding the U.S. Presidential Election of 2004, to study how often they referred to one another and to quantify the overlap in the topics they discussed, both within the liberal and conservative communities, and also across communities. We also study a single day snapshot of over 1,000 political blogs. This snapshot captures blogrolls (the list of links to other blogs frequently found in sidebars), and presents a more static picture of a broader blogosphere. Most significantly, we find differences in the
behavior of liberal and conservative blogs, with conservative blogs linking to each other more frequently and in a denser pattern.

#20 stevieray 5/15/08 10:55:32 pm 0

re: #6 Render

[Link: www.cs.cmu.edu...]

A comparison of "post similarity" between LGF and Kos?

WEIRDNESS,
R

Close. I think they are doing a text analysis looking for Charles' sock puppets -- like the way some trolls accuse Robert Spencer of posting as Hugh on his website... a way to broadcast controversial opinions yet keep your hands clean.

#21 zombie 5/15/08 10:55:39 pm 1

re: #10 Dan G.

Ta Da!

Here is the paper.

OK, here's the Abstract:

Abstract
In this paper, we study the linking patterns and discussion topics of political bloggers.
Our aim is to measure the degree of interaction between liberal and conservative blogs, and
to uncover any differences in the structure of the two communities. Specifically, we analyze
the posts of 40 “A-list” blogs over the period of two months preceding the U.S. Presidential
Election of 2004, to study how often they referred to one another and to quantify the overlap in
the topics they discussed, both within the liberal and conservative communities, and also across
communities. We also study a single day snapshot of over 1,000 political blogs. This snapshot
captures blogrolls (the list of links to other blogs frequently found in sidebars), and presents
a more static picture of a broader blogosphere. Most significantly, we find differences in the
behavior of liberal and conservative blogs, with conservative blogs linking to each other more
frequently and in a denser pattern.

Talk about pointless.

People get PhDs for this crap.

#22 pat 5/15/08 10:56:03 pm 0

Mandy will not be pleased. May give rise to foul language. Warning.

#23 JustMyView 5/15/08 10:56:11 pm 0

re: #16 Charles

But that's an old paper from 2004. The comments in the first link above are much more recent.

Looks like they are doing another election-year study looking at patterns of argument or other aspects of communication.

#24 karmic_inquisitor 5/15/08 10:56:17 pm 0

It is a correlation engine.

Here is the dictionary - frequencies posted after each word.

[Link: www.cs.cmu.edu...]

Based on the use of the term "sim" in the naming structures and the structure of the input files, I guess the engine is being run through multiple simulations.

For what purpose I don't know.

#25 laZardo 5/15/08 10:56:31 pm 0

I'm not in that experiment?

DISCRMIANTOIN!111

/lunch tiem, bbl

#26 Charles 5/15/08 10:56:39 pm 0

I don't think that paper is for the current experiment.

It's years older than the comments at the first link above.

#27 Render 5/15/08 10:56:41 pm 0

Following Zombie...

===

[Link: www.cs.cmu.edu...]

"Keep only the staff need to be on the web site.
most of the real experiment results should be kept in:"
malbec.ark.cs.cmu.edu/~polit/

===

EH?,
R

#28 Dan G. 5/15/08 10:57:17 pm 0

re: #16 Charles

But that's an old paper from 2004. The comments in the first link above are much more recent.

Perhaps they are running the experiment again for this election season, or for some other event...

#29 justacanuck 5/15/08 10:57:17 pm 0

Smells like they are trolling the comments looking for sockpuppets - either sockpuppets here, or socks that post here and elsewhere.

Perhaps it's a project to gather information on the posters and compare for similarities to other posting identities elsewhere, with maybe an end-goal to reveal actual the identities of the commentators here.

Nothing like using software to do the job many of us have done by hand when we suspect a sock in amongst a forum we frequent.

In light of the language in their file names and directory structure, it seems like a formal request for them to declare their intent would be reasonable, IMHO, of course.

#30 The Other Les 5/15/08 10:57:43 pm 0

re: #21 zombie

Piled Higher and Deeper?

#31 song_and_dance_man 5/15/08 10:58:36 pm 0

re: #26 Charles

I don't think that paper is for the current experiment.

It's years older than the comments at the first link above.

True. I mused about getting a sock called song and wrath man in the past few months.

#32 Dan G. 5/15/08 10:59:01 pm 0

re: #21 zombie

Talk about pointless.

People get PhDs for this crap.

It's mostly about determining communications/relationships amongst people with similar politics... I'm sure it would find application in the advertising industry (read spam industry)

#33 Killian Bundy 5/15/08 10:59:13 pm 0

re: #10 Dan G.

Ta Da!

Here is the paper.

The Political Blogosphere and the 2004 U.S. Election:
Divided They Blog

/only problem is that the comments on that page are recent

#34 zombie 5/15/08 10:59:32 pm 0

Oooooh, look at this page.

#35 JustMyView 5/15/08 10:59:47 pm 0

re: #29 justacanuck

Smells like they are trolling the comments looking for sockpuppets - either sockpuppets here, or socks that post here and elsewhere.

Perhaps it's a project to gather information on the posters and compare for similarities to other posting identities elsewhere, with maybe an end-goal to reveal actual the identities of the commentators here.

Nothing like using software to do the job many of us have done by hand when we suspect a sock in amongst a forum we frequent.

In light of the language in their file names and directory structure, it seems like a formal request for them to declare their intent would be reasonable, IMHO, of course.

I doubt they're interested in individual identities. Sounds like they're interested in things like the nature of online networks (e.g., which blogs link to which other blogs). I'm sure they would respond to an inquiry about the purpose of their study.

#36 Opilio 5/15/08 10:59:47 pm 0

They're using pretty recent stuff. One of the input files references about 60 threads with numbers mostly in the 28000-28600 range.

#37 pat 5/15/08 11:00:44 pm 0

Experiment will not work. Different blogs have different rules. Michelle deletes me with unusual vigor. Others do not care. the subject matter and initial commentary are different, as are the posters. E.G. Michelle's are boring, it is much more stimulating here.

#38 zombie 5/15/08 11:01:16 pm 0

re: #34 zombie

Oooooh, look at this page.

Interesting quote:

Did some experiment on comment prediction with link-plsa-lda. Withheld the comment information of 204 users from the training set. Then computed log-likelihoods of their comments on each blog post (985 post * 204 user = 201105 data points) according to the trained model. 5678 positive (i.e., there in fact is a comment by the user on the post) and 195427 negative examples. Below is the results.

Comment prediction?

/Get out of my head!

#39 karmic_inquisitor 5/15/08 11:01:46 pm 0

Here is the python script to run the correlation engine

[Link: www.cs.cmu.edu...]

Time to go looking at the other blogs they are running....

#40 Dan G. 5/15/08 11:02:15 pm 0

Yes, they are using NEW comments, but likely doing the SAME thing since the paper etc.. are all in the same project directory "TP" ("Talking Points"). It could be the collection of additional data for the data set.

#42 The Other Les 5/15/08 11:02:59 pm 0

re: #38 zombie

Is this an attempt to pass the Turing Test?

#43 Charles 5/15/08 11:03:02 pm 3

Here's the paper:

[Link: www.cs.cmu.edu...]

#44 Yankee Division Son 5/15/08 11:03:19 pm 0

I'm no expert, but.. port sniffing?

#45 pat 5/15/08 11:03:20 pm 0

re: #40 Dan G.

That is the way I see it.

#46 stevieray 5/15/08 11:03:39 pm 0
#47 gymnast 5/15/08 11:03:40 pm 2

Who ever thought that garbage labeled as "research" could be so sophisticated and at the same time be so worthless. Shit like this gives PhDs a bad name for good reason. Polishing farts to get Piled higher and Deeper has become, for the most part, an ephemeral exercise in self deception.

#48 karmic_inquisitor 5/15/08 11:03:42 pm 1

re: #38 zombie

This isn't that advanced of stuff - we are working on a simillar concept for internal corporate communications. This has been around a while.

#49 pat 5/15/08 11:05:35 pm 1

What a bunch of BS. .....Fail

#50 slokat 5/15/08 11:05:50 pm 0

They are testing for similarity in posts? As in do we group think?

#51 karmic_inquisitor 5/15/08 11:05:53 pm 2

re: #43 Charles

Hey Charles - you have f'd up their model by introducing comment rating and spinoff links.

#52 justacanuck 5/15/08 11:06:42 pm 0

re: #21 zombie

Talk about pointless.

People get PhDs for this crap.

Perhaps its a repeat of his 2004 masterwork, (cough) now refined for the 2008 election season.

#53 Sharmuta 5/15/08 11:06:55 pm 0

Creepy!

#54 pat 5/15/08 11:07:02 pm 0

re: #51 karmic_inquisitor

Likely true. I spend most of my time on links researching now. So do others.

#55 The Other Les 5/15/08 11:07:12 pm 0

re: #49 pat

What a bunch of BS. .....Fail

Are they trying to figure out how we think?

#56 Dan G. 5/15/08 11:07:25 pm 1

re: #47 gymnast

Who ever thought that garbage labeled as "research" could be so sophisticated and at the same time be so worthless. Shit like this gives PhDs a bad name for good reason. Polishing farts to get Piled higher and Deeper has become, for the most part, an ephemeral exercise in self deception.

I don't think that this stuff is useless, it may be dated as #48 suggests, but automated compiliation of hot topics is rougly what google news is...

#57 slokat 5/15/08 11:07:26 pm 0

(reuter run the same story, LGF respond twice in the like mnnaer.)

#58 karmic_inquisitor 5/15/08 11:07:41 pm 0

re: #50 slokat

They are testing for similarity in posts? As in do we group think?

Every group does group think. The patterns are a bit disconcerting at first glance but then make sense when you think about them.

Most businesses use email to destroy original thought. I can prove that.

#59 song_and_dance_man 5/15/08 11:07:59 pm 0
#60 Killian Bundy 5/15/08 11:08:00 pm 0

EXPERIMENTATION WITHOUT REPRESENTATION!

/I demand compensation for being used as an unwilling test subject!

#61 Charles 5/15/08 11:08:08 pm 19

I wonder if linking to their experiment will introduce fatal feedback loops, leading to a disturbance in the Krell mind-field and possible space-time discontinuity?

#62 The Other Les 5/15/08 11:08:28 pm 0

re: #58 karmic_inquisitor

Every group does group think. The patterns are a bit disconcerting at first glance but then make sense when you think about them.

Most businesses use email to destroy original thought. I can prove that.

Sounds like a future Dilbert strip.

#63 pat 5/15/08 11:08:30 pm 0

re: #55 The Other Les

Yes. Exactly. And to identify cross posters. All they had to do was ask.

#64 JustMyView 5/15/08 11:09:03 pm 0

re: #44 Yankee Division Son

I'm no expert, but.. port sniffing?

What is port sniffing?

#65 zombie 5/15/08 11:09:17 pm 3

Man, I'm digging through their entire directory system, including trashed/deleted files. No restrictions.

Don't these people know how to protect their directories? Amateurish.

I mean, they're doing statistical analysis of LGF comments, and they don't know thing one about Internet security? I thought these people were PhD candidates in computer science or something. Maybe statistics? Political science?

#66 laZardo 5/15/08 11:09:25 pm 0

re: #61 Charles

Maybe it'll start up the Large Hadron Collider and cause division by zero.

/OH SHI-

//actually goes to lunch nao

#67 Charles 5/15/08 11:09:42 pm 1

re: #65 zombie

Man, I'm digging through their entire directory system, including trashed/deleted files. No restrictions.

Don't these people know how to protect their directories? Amateurish.

I mean, they're doing statistical analysis of LGF comments, and they don't know thing one about Internet security? I thought these people were PhD candidates in computer science or something. Maybe statistics? Political science?

I know. It's wide open. Amazing.

#68 Dan G. 5/15/08 11:09:45 pm 1

re: #51 karmic_inquisitor

Hey Charles - you have f'd up their model by introducing comment rating and spinoff links.

I wonder if this thread screws up his project, if the subjects (i.e. us) know they're being experimented on it kind of screws up many experiements...

#69 karmic_inquisitor 5/15/08 11:09:49 pm 0

re: #64 JustMyView

What is port sniffing?

A technique to see what traffic is going over a network port. Used by hackers and network administrators alike.

#70 pat 5/15/08 11:10:04 pm 0

This program could identify sock puppets, but won't. Because it is a dead end. As Karmic says.

#71 Killian Bundy 5/15/08 11:10:26 pm 1

re: #64 JustMyView

What is port sniffing?

/turn your head to the left and breathe through your nose

#72 Dan G. 5/15/08 11:11:06 pm 0

re: #65 zombie

Man, I'm digging through their entire directory system, including trashed/deleted files. No restrictions.

Don't these people know how to protect their directories? Amateurish.

I mean, they're doing statistical analysis of LGF comments, and they don't know thing one about Internet security? I thought these people were PhD candidates in computer science or something. Maybe statistics? Political science?


Computer Science

#73 The Other Les 5/15/08 11:11:14 pm 0

re: #61 Charles

I wonder if linking to their experiment will introduce fatal feedback loops, leading to a disturbance in the Krell mind-field and possible space-time discontinuity?

Um... didn't they blow up a planet that way?

#74 gymnast 5/15/08 11:11:24 pm 0

re: #56 Dan G.

I don't think that this stuff is useless, it may be dated as #48 suggests, but automated compiliation of hot topics is rougly what google news is...

And Google news has what qualities that you are willing to allow it to do your thinking for you? Are you contextually illiterate?

#75 haakondahl 5/15/08 11:11:25 pm 0

Keep only the staff need to be on the web site.
most of the real experiment results should be kept in:
malbec.ark.cs.cmu.edu/~polit/

#76 zombie 5/15/08 11:11:50 pm 5

re: #43 Charles

Here's the paper:

[Link: www.cs.cmu.edu...]

Abstract
In this work, we address the twin problems of unsupervised
topic discovery and estimation of topic specific influence of
blogs. We propose a new model that can be used to provide a
user with highly influential blog postings on the topic of the
user’s interest.

Idiots. Using computer algorithms to determine how to made a good blog post?

How about this for a clue-byte, you bozos: Have political acumen!

#77 pat 5/15/08 11:12:09 pm 0

Did pat say any thing snappy?

#78 JustMyView 5/15/08 11:12:16 pm 0

re: #56 Dan G.

I don't think that this stuff is useless, it may be dated as #48 suggests, but automated compiliation of hot topics is rougly what google news is...

Right. There are all sorts of reasons to be developing systems and tools to analyze large text databases. I have no idea what they're searching for, but it doesn't appear to be anything having to do w/ the content of the posts here.

#79 victor_yugo 5/15/08 11:12:19 pm 1

[Link: www.cs.cmu.edu...]

(This link WILL become invalid soon!)

It looks to my un-Pythonesque eye like a programmer is using the cosine function to create some kind of mapping on how comments refer to one another based on comment location in the thread; and looking at different types of postings from Charles (open thread, breaking news, news outlet criticisms) to see if there are any trends in the discussions that vary according to the posting type.

That has to be one of the worst run-on sentences I've ever written.

#80 Opilio 5/15/08 11:12:25 pm 0

From one of the files:

topic 003
3253 mccain
2526 huckabe
4497 romnei
4527 rudi
2229 gop
2194 giuliani
3368 mitt
2525 huck
5310 think
3323 mike

topic 006
2212 god
4350 religion
864 christian
445 believ
4352 religi
875 church
3412 mormon
1882 faith
4384 republican
302 atheist

#81 The Other Les 5/15/08 11:12:27 pm 0

re: #65 zombie

Man, I'm digging through their entire directory system, including trashed/deleted files. No restrictions.

Don't these people know how to protect their directories? Amateurish.

I mean, they're doing statistical analysis of LGF comments, and they don't know thing one about Internet security? I thought these people were PhD candidates in computer science or something. Maybe statistics? Political science?

At the level of dumb I'd have to say it was Political Science.

#82 bosforus 5/15/08 11:12:56 pm 0

who is lionheart?

#83 Dan G. 5/15/08 11:13:05 pm 1

re: #74 gymnast

And Google news has what qualities that you are willing to allow it to do your thinking for you? Are you contextually illiterate

Fetching news for me isn't thinking for me... "contextually illiterate"... define that please.