|
|
Welcome to the Invelos forums. Please read the forum
rules before posting.
Read access to our public forums is open to everyone. To post messages, a free
registration is required.
If you have an Invelos account, sign in to post.
|
|
|
|
Invelos Forums->DVD Profiler: Contribution Discussion |
Page:
1... 11 12 13 Previous Next
|
Can we ever stop copying from IMDb? |
|
|
|
Author |
Message |
| T!M | Profiling since Dec. 2000 |
Registered: March 13, 2007 | Reputation: | Posts: 8,736 |
| Posted: | | | | Quoting surfeur51: Quote: Quoting T!M:
Quote:
It does not. Not at all. Virtually every name that consists of more than two parts have separate, non-linking entries for the same person in the database.
And virtually every accented name have separate, non-linking entries for the same person in the database. For you, one (parsing) is a big problem, and the second (accents) is not, as you state in the Contribtion Rule Committee forum. I hardly see the difference. Let me then, for the umpteenth time, point out that difference. The difference is that for accents, we've got a crystal clear rule with which such differences can be cleaned up. For parsing, we have no rules at all: it's anything goes. Even when we discuss a practical example in the forums, the results pretty much invariably lead to something of a tie. Differences in the use of accents can be easily fixed, because there's a rule to refer to that tells us how to handle them, while differences in parsing cannot be "fixed", because there's no rule to refer to that tells us what is "correct". That's the difference. |
| | T!M | Profiling since Dec. 2000 |
Registered: March 13, 2007 | Reputation: | Posts: 8,736 |
| Posted: | | | | Quoting Kathy: Quote: I've read these discussions many times over the years. I understand and can appreciate both sides of the issue. Same here. Quote: I clearly see what each side has to say - I don't think those points to be brought to the table any more. True. Quote: But, and please correct me if I'm wrong, it seems that the solution is not in our hands. That depends what solution you're looking for. Within the constraints of the program as it stands today, Ken already provided a solution. Keep in mind that this discussion is not about a new problem that needs to be fixed, it's about a problem that Ken already provided a solution for - a solution that someone isn't happy with. That doesn't make the discussion any less worth having, I'm not saying that, but we do need to see it for what it is. Quote: Because the only solutions presented here that has the ability to satisfy both sides of this issue seems to require an overhaul of Ken's database. True. Within the constraints of the program we have now, I feel that Ken has already made the one and only right decision. Wat Yves wants, can never work within the current system, which is why his request boils down to the wish for a complete program overhaul. I could support that as well, but it's something entirely different than tinkering with the rules for the current system. That, IMHO, can never work, while I do see possibilities with a complete program overhaul. Quote: Until that happens, the continual rehashing of the same arguments seems to me to be a waste of time and energy. It is indeed. Quote: Is there a solution that satisfies both sides? No. Certainly not without a complete program overhaul. Quote: Try and come up with one looking at it from others point of view. I've been trying to look at the "bigger picture" from day one - years ago. Others, unfortunately, seem to be stuck on a few specific examples in their own profiles that they'd like to be able to handle differently, and tend to forget about the ramifications for the half a million other profiles in the database. Quote: I know what everyone wants - please work on the solution. Again: there isn't one - except the solution that Ken already gave us, of course. Within the constraints of the current program, it can't be done - not succesfully, that is - any other way, IMHO. | | | Last edited: by T!M |
| Registered: February 23, 2009 | Reputation: | Posts: 1,580 |
| Posted: | | | | Quoting Kathy: Quote: Why do I say this? Because the only solutions presented here that has the ability to satisfy both sides of this issue seems to require an overhaul of Ken's database.
Until that happens, the continual rehashing of the same arguments seems to me to be a waste of time and energy. Agreed. If everyone wants a solution and a solution that not only works for them but also for other users, then I think the sooner we as a community plead Invelos with one voice, the better | | | Blu-ray collection DVD collection My Games My Trophies |
| Registered: March 13, 2007 | Reputation: | Posts: 13,202 |
| Posted: | | | | Quoting RHo: Quote: Quoting CharlieM:
Quote: And exactly how do you propose to issue a rule that can be consistently applied by all people across "50 different languages"? The same way we do it for title capitalisation or name parsing? Use your knowledge and document changes. The problem with this solution is, parsing and capitalization are ignored by the CLT. The CLT uses the 'credited as' name, which has no parsing. FRANCOIS, Francois, francois and FrAnCoIs are all treated as the same name by the CLT. Francois and François, for reasons only Ken knows, are not. | | | No dictator, no invader can hold an imprisoned population by force of arms forever. There is no greater power in the universe than the need for freedom. Against this power, governments and tyrants and armies cannot stand. The Centauri learned this lesson once. We will teach it to them again. Though it take a thousand years, we will be free. - Citizen G'Kar |
| Registered: March 14, 2007 | Reputation: | Posts: 6,744 |
| Posted: | | | | Quoting TheMadMartian: Quote: Francois and François, for reasons only Ken knows, are not. A lower-case upper-case conversion functions comes with every current programming language, just as a comparer that can ignore casing. But to recognize that C = ç needs a comparer based on a specific culture (French, Spanish). For example, in Russian what looks like a c and a p are actually an s and an r, hence CCCP = SSSR (Soyuz Sovetskikh Sotsialisticheskikh Respublik) -> USSR (Union of Soviet Socialist Republics). It's just very complicated and nearly impossible to program when you have to consider all languages at the same time. | | | Karsten DVD Collectors Online
| | | Last edited: by DJ Doena |
| Registered: March 29, 2007 | Reputation: | Posts: 4,479 |
| Posted: | | | | Quoting Kathy: Quote:
I know what everyone wants - please work on the solution. Antolod proposed two texts in the Rules Committee forum, that allow people non familiar to other languages to contribute without any problem, and also allow the correct spelling of names. I can agree with both of them, but nobody, except me, gave his opinion on those texts. | | | Images from movies |
| Registered: May 29, 2007 | Reputation: | Posts: 3,475 |
| Posted: | | | | Quoting surfeur51: Quote: Quoting Kathy:
Quote:
I know what everyone wants - please work on the solution.
Antolod proposed two texts in the Rules Committee forum, that allow people non familiar to other languages to contribute without any problem, and also allow the correct spelling of names. I can agree with both of them, but nobody, except me, gave his opinion on those texts. I do not understand texts or anything to do with programing and therefore can not address those issues. But, if these suggestions solve the problem, give them to Ken to assess and make a decision. Either he will implement them or he will ignore them. In either case the community will have the answer to the question. |
| Registered: March 13, 2007 | Posts: 2,759 |
| Posted: | | | | Quoting TheMadMartian: Quote: Quoting RHo:
Quote: Quoting CharlieM:
Quote: And exactly how do you propose to issue a rule that can be consistently applied by all people across "50 different languages"? The same way we do it for title capitalisation or name parsing? Use your knowledge and document changes. The problem with this solution is, parsing and capitalization are ignored by the CLT. The CLT uses the 'credited as' name, which has no parsing. FRANCOIS, Francois, francois and FrAnCoIs are all treated as the same name by the CLT. Francois and François, for reasons only Ken knows, are not. The CLT could be changed to ignore accents. But IMO the linking system is broken beyond the CLT. |
| Registered: March 13, 2007 | Posts: 951 |
| Posted: | | | | Quoting RHo: Quote: The CLT could be changed to ignore accents. But IMO the linking system is broken beyond the CLT. I mentioned this earlier but, the more I think of it the more it may work with the way Invelos currently does linking. When a profile is uploaded with the name of François it is automatically given the common name of Francois with the credited as François. That way it wouldn't matter what country the profile is from or where the actor is from. The credited as name would still display however it is either entered by the language standards from that region. The common name would just be there to handle linking. This would in no way solve any of the other linking issues we have now but, given I don't think Ken is going to change his position on the existing linking system it may be the best all around solution. | | | Are you local? This is a local shop the strangers you would bring would not understand us, our customs, our local ways. |
| Registered: March 13, 2007 | Posts: 2,759 |
| Posted: | | | | Quoting DJ Doena: Quote: Quoting TheMadMartian:
Quote: Francois and François, for reasons only Ken knows, are not.
A lower-case upper-case conversion functions comes with every current programming language, just as a comparer that can ignore casing.
But to recognize that C = ç needs a comparer based on a specific culture (French, Spanish). For example, in Russian what looks like a c and a p are actually an s and an r, hence CCCP = SSSR (Soyuz Sovetskikh Sotsialisticheskikh Respublik) -> USSR (Union of Soviet Socialist Republics).
It's just very complicated and nearly impossible to program when you have to consider all languages at the same time. The Cyrillic alphabet is beyond the scope of profiler. But for Latin characters a comparison function which ignores accents is easy to implement. P.S. Even if profiler would support Cyrillic characters via Unicode, your example would be wrong, because the Cyrillic characters are different to the Latin ones even if they have the same look. |
| Registered: March 13, 2007 | Reputation: | Posts: 13,202 |
| Posted: | | | | Quoting Kathy: Quote: Quoting surfeur51:
Quote: Quoting Kathy:
Quote:
I know what everyone wants - please work on the solution.
Antolod proposed two texts in the Rules Committee forum, that allow people non familiar to other languages to contribute without any problem, and also allow the correct spelling of names. I can agree with both of them, but nobody, except me, gave his opinion on those texts.
I do not understand texts or anything to do with programing and therefore can not address those issues.
But, if these suggestions solve the problem, give them to Ken to assess and make a decision.
Either he will implement them or he will ignore them. In either case the community will have the answer to the question. Ken was given these options. More than a few of us agreed with them. In fact, as surfeur often points out, Gerri agreed with them. Ken, however, did not and, rather quickly, came in and went the other way. That is the main reason, at least in my opinion, that people don't bother discussing a rule change...because Ken has already made the decision. | | | No dictator, no invader can hold an imprisoned population by force of arms forever. There is no greater power in the universe than the need for freedom. Against this power, governments and tyrants and armies cannot stand. The Centauri learned this lesson once. We will teach it to them again. Though it take a thousand years, we will be free. - Citizen G'Kar |
| Registered: May 29, 2007 | Reputation: | Posts: 3,475 |
| Posted: | | | | Quoting TheMadMartian: Quote:
Ken was given these options. More than a few of us agreed with them. In fact, as surfeur often points out, Gerri agreed with them. Ken, however, did not and, rather quickly, came in and went the other way. That is the main reason, at least in my opinion, that people don't bother discussing a rule change...because Ken has already made the decision. Then, in my opinion, the matter is settled. |
| Registered: March 13, 2007 | Reputation: | Posts: 13,202 |
| Posted: | | | | Quoting RHo: Quote: The CLT could be changed to ignore accents. But IMO the linking system is broken beyond the CLT. As I am not a programmer, I will have to take your word for it. As for the linking system being broken beyond the CLT, you are correct. While I applaud the 20 or so people that are trying to get it fixed, unless they are the only ones contributing, they are doom to the same fate as Sisyphus. | | | No dictator, no invader can hold an imprisoned population by force of arms forever. There is no greater power in the universe than the need for freedom. Against this power, governments and tyrants and armies cannot stand. The Centauri learned this lesson once. We will teach it to them again. Though it take a thousand years, we will be free. - Citizen G'Kar |
| Registered: January 1, 2009 | Reputation: | Posts: 3,087 |
| Posted: | | | | Quoting surfeur51: Quote: Quoting T!M:
Quote:
It does not. Not at all. Virtually every name that consists of more than two parts have separate, non-linking entries for the same person in the database.
And virtually every accented name have separate, non-linking entries for the same person in the database. ... Just to be complete, the problem is not just for accented names, it's the same with: - names with different used spellings - names with middle initials - names with Nickname parts ... Consequence is for all the same. |
|
|
Invelos Forums->DVD Profiler: Contribution Discussion |
Page:
1... 11 12 13 Previous Next
|
|
|
|
|
|
|
|
|