Welcome to the Invelos forums. Please read the forum rules before posting.

Read access to our public forums is open to everyone. To post messages, a free registration is required.

If you have an Invelos account, sign in to post.

    Invelos Forums->DVD Profiler: Contribution Discussion Page: 1... 11 12 13  Previous   Next
Can we ever stop copying from IMDb?
Author Message
DVD Profiler Unlimited RegistrantStar ContributorT!M
Profiling since Dec. 2000
Registered: March 13, 2007
Reputation: Highest Rating
Netherlands Posts: 8,736
Posted:
PM this userDirect link to this postReply with quote
Quoting surfeur51:
Quote:
Quoting T!M:
Quote:

It does not. Not at all. Virtually every name that consists of more than two parts have separate, non-linking entries for the same person in the database.


And virtually every accented name have separate, non-linking entries for the same person in the database. For you, one (parsing) is a big problem, and the second (accents) is not, as you state in the Contribtion Rule Committee forum. I hardly see the difference.

Let me then, for the umpteenth time, point out that difference. The difference is that for accents, we've got a crystal clear rule with which such differences can be cleaned up. For parsing, we have no rules at all: it's anything goes. Even when we discuss a practical example in the forums, the results pretty much invariably lead to something of a tie. Differences in the use of accents can be easily fixed, because there's a rule to refer to that tells us how to handle them, while differences in parsing cannot be "fixed", because there's no rule to refer to that tells us what is "correct". That's the difference.
DVD Profiler Unlimited RegistrantStar ContributorT!M
Profiling since Dec. 2000
Registered: March 13, 2007
Reputation: Highest Rating
Netherlands Posts: 8,736
Posted:
PM this userDirect link to this postReply with quote
Quoting Kathy:
Quote:
I've read these discussions many times over the years. I understand and can appreciate both sides of the issue.

Same here.

Quote:
I clearly see what each side has to say - I don't think those points to be brought to the table any more.

True.

Quote:
But, and please correct me if I'm wrong, it seems that the solution is not in our hands.

That depends what solution you're looking for. Within the constraints of the program as it stands today, Ken already provided a solution. Keep in mind that this discussion is not about a new problem that needs to be fixed, it's about a problem that Ken already provided a solution for - a solution that someone isn't happy with. That doesn't make the discussion any less worth having, I'm not saying that, but we do need to see it for what it is.

Quote:
Because the only solutions presented here that has the ability to satisfy both sides of this issue seems to require an overhaul of Ken's database.

True. Within the constraints of the program we have now, I feel that Ken has already made the one and only right decision. Wat Yves wants, can never work within the current system, which is why his request boils down to the wish for a complete program overhaul. I could support that as well, but it's something entirely different than tinkering with the rules for the current system. That, IMHO, can never work, while I do see possibilities with a complete program overhaul.

Quote:
Until that happens, the continual rehashing of the same arguments seems to me to be a waste of time and energy.

It is indeed.

Quote:
Is there a solution that satisfies both sides?

No. Certainly not without a complete program overhaul.

Quote:
Try and come up with one looking at it from others point of view.

I've been trying to look at the "bigger picture" from day one - years ago. Others, unfortunately, seem to be stuck on a few specific examples in their own profiles that they'd like to be able to handle differently, and tend to forget about the ramifications for the half a million other profiles in the database.

Quote:
I know what everyone wants - please work on the solution.

Again: there isn't one - except the solution that Ken already gave us, of course. Within the constraints of the current program, it can't be done - not succesfully, that is - any other way, IMHO.
 Last edited: by T!M
DVD Profiler Desktop and Mobile RegistrantStar ContributorTaro
Registered: February 23, 2009
Reputation: High Rating
Belgium Posts: 1,580
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Kathy:
Quote:
Why do I say this? Because the only solutions presented here that has the ability to satisfy both sides of this issue seems to require an overhaul of Ken's database.

Until that happens, the continual rehashing of the same arguments seems to me to be a waste of time and energy.

Agreed. If everyone wants a solution and a solution that not only works for them but also for other users, then I think the sooner we as a community plead Invelos with one voice, the better
Blu-ray collection
DVD collection
My Games
My Trophies
DVD Profiler Desktop and Mobile RegistrantStar ContributorTheMadMartian
Alien with an attitude
Registered: March 13, 2007
Reputation: Highest Rating
United States Posts: 13,202
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Quoting RHo:
Quote:
Quoting CharlieM:
Quote:
And exactly how do you propose to issue a rule that can be consistently applied by all people across  "50 different languages"?

The same way we do it for title capitalisation or name parsing? Use your knowledge and document changes.

The problem with this solution is, parsing and capitalization are ignored by the CLT.  The CLT uses the 'credited as' name, which has no parsing.  FRANCOIS, Francois, francois and FrAnCoIs are all treated as the same name by the CLT.  Francois and François, for reasons only Ken knows, are not.
No dictator, no invader can hold an imprisoned population by force of arms forever.
There is no greater power in the universe than the need for freedom.
Against this power, governments and tyrants and armies cannot stand.
The Centauri learned this lesson once.
We will teach it to them again.
Though it take a thousand years, we will be free.
- Citizen G'Kar
DVD Profiler Desktop and Mobile RegistrantStar ContributorDJ Doena
Registered: May 1, 2002
Registered: March 14, 2007
Reputation: Highest Rating
Germany Posts: 6,744
Posted:
PM this userEmail this userVisit this user's homepageView this user's DVD collectionDirect link to this postReply with quote
Quoting TheMadMartian:
Quote:
Francois and François, for reasons only Ken knows, are not.


A lower-case upper-case conversion functions comes with every current programming language, just as a comparer that can ignore casing.

But to recognize that C = ç needs a comparer based on a specific culture (French, Spanish). For example, in Russian what looks like a c and a p are actually an s and an r, hence CCCP = SSSR (Soyuz Sovetskikh Sotsialisticheskikh Respublik) -> USSR (Union of Soviet Socialist Republics).

It's just very complicated and nearly impossible to program when you have to consider all languages at the same time.
Karsten
DVD Collectors Online

 Last edited: by DJ Doena
DVD Profiler Unlimited RegistrantStar Contributorsurfeur51
Since July 3, 2003
Registered: March 29, 2007
Reputation: Great Rating
France Posts: 4,479
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Kathy:
Quote:

I know what everyone wants - please work on the solution.


Antolod proposed two texts in the Rules Committee forum, that allow people non familiar to other languages to contribute without any problem, and also allow the correct spelling of names. I can agree with both of them, but nobody, except me, gave his opinion on those texts.
Images from movies
DVD Profiler Unlimited RegistrantStar ContributorKathy
Registered: May 29, 2007
Reputation: Highest Rating
United States Posts: 3,475
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Quoting surfeur51:
Quote:
Quoting Kathy:
Quote:

I know what everyone wants - please work on the solution.


Antolod proposed two texts in the Rules Committee forum, that allow people non familiar to other languages to contribute without any problem, and also allow the correct spelling of names. I can agree with both of them, but nobody, except me, gave his opinion on those texts.


I do not understand texts or anything to do with programing and therefore can not address those issues.

But, if these suggestions solve the problem, give them to Ken to assess and make a decision.

Either he will implement them or he will ignore them. In either case the community will have the answer to the question.
DVD Profiler Unlimited RegistrantStar ContributorRHo
Registered: March 13, 2007
Posts: 2,759
Posted:
PM this userDirect link to this postReply with quote
Quoting TheMadMartian:
Quote:
Quoting RHo:
Quote:
Quoting CharlieM:
Quote:
And exactly how do you propose to issue a rule that can be consistently applied by all people across  "50 different languages"?

The same way we do it for title capitalisation or name parsing? Use your knowledge and document changes.

The problem with this solution is, parsing and capitalization are ignored by the CLT.  The CLT uses the 'credited as' name, which has no parsing.  FRANCOIS, Francois, francois and FrAnCoIs are all treated as the same name by the CLT.  Francois and François, for reasons only Ken knows, are not.

The CLT could be changed to ignore accents. But IMO the linking system is broken beyond the CLT.
DVD Profiler Unlimited RegistrantStar ContributorTracer
Registered: March 13, 2007
United States Posts: 951
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Quoting RHo:
Quote:
The CLT could be changed to ignore accents. But IMO the linking system is broken beyond the CLT.


I mentioned this earlier but, the more I think of it the more it may work with the way Invelos currently does linking. 

When a profile is uploaded with the name of François it is automatically given the common name of Francois with the credited as François.  That way it wouldn't matter what country the profile is from or where the actor is from.  The credited as name would still display however it is either entered by the language standards from that region.  The common name would just be there to handle linking.

This would in no way solve any of the other linking issues we have now but, given I don't think Ken is going to change his position on the existing linking system it may be the best all around solution.
Are you local?
This is a local shop the strangers you would bring would not understand us, our customs, our local ways.
DVD Profiler Unlimited RegistrantStar ContributorRHo
Registered: March 13, 2007
Posts: 2,759
Posted:
PM this userDirect link to this postReply with quote
Quoting DJ Doena:
Quote:
Quoting TheMadMartian:
Quote:
Francois and François, for reasons only Ken knows, are not.


A lower-case upper-case conversion functions comes with every current programming language, just as a comparer that can ignore casing.

But to recognize that C = ç needs a comparer based on a specific culture (French, Spanish). For example, in Russian what looks like a c and a p are actually an s and an r, hence CCCP = SSSR (Soyuz Sovetskikh Sotsialisticheskikh Respublik) -> USSR (Union of Soviet Socialist Republics).

It's just very complicated and nearly impossible to program when you have to consider all languages at the same time.

The Cyrillic alphabet is beyond the scope of profiler. But for Latin characters a comparison function which ignores accents is easy to implement.

P.S. Even if profiler would support Cyrillic characters via Unicode, your example would be wrong, because the Cyrillic characters are different to the Latin ones even if they have the same look.
DVD Profiler Desktop and Mobile RegistrantStar ContributorTheMadMartian
Alien with an attitude
Registered: March 13, 2007
Reputation: Highest Rating
United States Posts: 13,202
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Kathy:
Quote:
Quoting surfeur51:
Quote:
Quoting Kathy:
Quote:

I know what everyone wants - please work on the solution.


Antolod proposed two texts in the Rules Committee forum, that allow people non familiar to other languages to contribute without any problem, and also allow the correct spelling of names. I can agree with both of them, but nobody, except me, gave his opinion on those texts.


I do not understand texts or anything to do with programing and therefore can not address those issues.

But, if these suggestions solve the problem, give them to Ken to assess and make a decision.

Either he will implement them or he will ignore them. In either case the community will have the answer to the question.

Ken was given these options.  More than a few of us agreed with them.  In fact, as surfeur often points out, Gerri agreed with them.  Ken, however, did not and, rather quickly, came in and went the other way.  That is the main reason, at least in my opinion, that people don't bother discussing a rule change...because Ken has already made the decision.
No dictator, no invader can hold an imprisoned population by force of arms forever.
There is no greater power in the universe than the need for freedom.
Against this power, governments and tyrants and armies cannot stand.
The Centauri learned this lesson once.
We will teach it to them again.
Though it take a thousand years, we will be free.
- Citizen G'Kar
DVD Profiler Unlimited RegistrantStar ContributorKathy
Registered: May 29, 2007
Reputation: Highest Rating
United States Posts: 3,475
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Quoting TheMadMartian:
Quote:

Ken was given these options.  More than a few of us agreed with them.  In fact, as surfeur often points out, Gerri agreed with them.  Ken, however, did not and, rather quickly, came in and went the other way.  That is the main reason, at least in my opinion, that people don't bother discussing a rule change...because Ken has already made the decision.


Then, in my opinion, the matter is settled.
DVD Profiler Desktop and Mobile RegistrantStar ContributorTheMadMartian
Alien with an attitude
Registered: March 13, 2007
Reputation: Highest Rating
United States Posts: 13,202
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Quoting RHo:
Quote:
The CLT could be changed to ignore accents. But IMO the linking system is broken beyond the CLT.

As I am not a programmer, I will have to take your word for it.  As for the linking system being broken beyond the CLT, you are correct.  While I applaud the 20 or so people that are trying to get it fixed, unless they are the only ones contributing, they are doom to the same fate as Sisyphus.
No dictator, no invader can hold an imprisoned population by force of arms forever.
There is no greater power in the universe than the need for freedom.
Against this power, governments and tyrants and armies cannot stand.
The Centauri learned this lesson once.
We will teach it to them again.
Though it take a thousand years, we will be free.
- Citizen G'Kar
DVD Profiler Unlimited RegistrantStar ContributorVirusPil
uncredited
Registered: January 1, 2009
Reputation: Highest Rating
Germany Posts: 3,087
Posted:
PM this userDirect link to this postReply with quote
Quoting surfeur51:
Quote:
Quoting T!M:
Quote:

It does not. Not at all. Virtually every name that consists of more than two parts have separate, non-linking entries for the same person in the database.


And virtually every accented name have separate, non-linking entries for the same person in the database. ...


Just to be complete, the problem is not just for accented names, it's the same with:
- names with different used spellings
- names with middle initials
- names with Nickname parts
...

Consequence is for all the same.
    Invelos Forums->DVD Profiler: Contribution Discussion Page: 1... 11 12 13  Previous   Next