|
|
Welcome to the Invelos forums. Please read the forum
rules before posting.
Read access to our public forums is open to everyone. To post messages, a free
registration is required.
If you have an Invelos account, sign in to post.
|
|
|
|
Invelos Forums->General: Website Discussion |
Page:
1... 14 15 16 17 18 ...26 Previous Next
|
goodguy's Credit Lookup Plus |
|
|
|
Author |
Message |
Registered: March 18, 2007 | Reputation: | Posts: 6,461 |
| Posted: | | | | Quoting GSyren: Quote: Quoting mediadogg:
Quote: I was hoping that you would offer to sanitize on the way in, then never have to worry about it for subsequent queries. That requires that the output is run through CltInfo before being used for other purposes. But maybe that's a limitation that we can live with. I think for now, CLTInfo should be viewed as the "go to" successor to CLTPlus. Anybody else who want to use the raw XML will undoubtedly also have the skills to do the search. Maybe I can also add a checkbox (here we go again). User can decide whether to suffer my search or leave it to the experts. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: March 18, 2007 | Reputation: | Posts: 6,461 |
| Posted: | | | | Out of the box: this would be helpful for me as well: maybe I could "taint" the Profiler XML again (hey, once the virginity is gone ...), with a flag indicating a likely false candiate, such as:
- not a singleton (100% sure it is good, if a singleton) - different title from original UPC (this still requires a simple XML parse and compare operation) - different production date (I forgot - that is actually in the scrape data! So, another 100% - if bad production date, not good). This one, maybe I should do. No need to parse the XML. Just a "GetProductionDate" method off the profile object that I get back from the online. I do not know if the method makes another call to the database. I would hope that once the object is downloaded, the associated data has been cached.
Any other thoughts? The sanitizer would then only have to scan the flagged profiles to check if credits actually exist | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: March 18, 2007 | Reputation: | Posts: 6,461 |
| Posted: | | | | Had an idea that is bearing fruit. Since I must make at least one database call per scraped UPC, I am using as much as possible from that one call, trying to avoid any time spent in parsing the xml or making additional database calls.
(1) I get a UPC from the CLT scrape, but there is a small % chance it is not valid (explained in detail earlier) (2) I make the database call to retrieve the profiles associated with the UPC (3) In lieu of scanning the XML, I do the following quick comparisons with data already collected
- if there is only one profile, by definition its a match - otherwise try a simple match on ( (title or title:edition) and production year ) which indicates a match - if not either of those, then whether credits exist can only be determined by XML scan or database calls
I am finding only a very small percentage of profiles that need scanning, so maybe it is better if I just go ahead and do it, by default. but allow the user to disable the validation step and just deliver all the profiles found by the database calls, to the XML, which can result is a small percentage of profiles that actually have no credits for the variants in question. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: March 18, 2007 | Reputation: | Posts: 6,461 |
| Posted: | | | | Actually Xpath seems pretty fast. Just working out the details. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. |
| Registered: March 14, 2007 | Reputation: | Posts: 4,678 |
| Posted: | | | | I've been playing around a bit with XPath to see how easy it would be to create the common name statistics from the Profiler XML format. It turns out it works quite well. I realized that I could quite easily use it for either format, but it would be a whole lot easier if the CLTBoss output had Actors/Actor and Credits/Credit instead of CLTCredits/CLTCredit.
I don't suppose you would consider changing that, this late in the game? | | | My freeware tools for DVD Profiler users. Gunnar |
| Registered: March 18, 2007 | Reputation: | Posts: 6,461 |
| Posted: | | | | Quoting GSyren: Quote: I've been playing around a bit with XPath to see how easy it would be to create the common name statistics from the Profiler XML format. It turns out it works quite well. I realized that I could quite easily use it for either format, but it would be a whole lot easier if the CLTBoss output had Actors/Actor and Credits/Credit instead of CLTCredits/CLTCredit.
I don't suppose you would consider changing that, this late in the game? Of course. I had kind of changed my priorities to the Invelos format output, but I can make the CLTBoss format anything you want, as you are the primary customer. So OK, done. I am still perfecting my Xpath statement that I use simply to cull out profiles with no credits for the Invelos output. I get a "go/no-go" decision and if go, I just grab the Invelos XML for output. No further formatting or parsing of the XML is required. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. |
| Registered: March 18, 2007 | Reputation: | Posts: 6,461 |
| Posted: | | | | Oh and just to be clear, Variants/Variant goes to Actors/Actor in both forms of XML output?
Sample that I will delete to reduce clutter, after you take a look.
Content deleted.
| | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: May 29, 2007 | Reputation: | Posts: 3,475 |
| | Registered: March 18, 2007 | Reputation: | Posts: 6,461 |
| Posted: | | | | Quoting Kathy: Quote: . Hey Kathy, glad to see you monitoring this thread. Any thoughts? | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. |
| Registered: March 14, 2007 | Reputation: | Posts: 4,678 |
| Posted: | | | | OK, a couple of thought.
I would prefer if the variants went into a node with a name that isn't also used later, so perhaps Arguments/Argument, or something similar. And there seems no point in having FirstName, MiddleName and LastName there if they are not used as such (unless you have something else in mind for them). If you actually have them properly parsed, that would be really helpful.
And what I had in mind instead of CLTCredits/CLTCredit was Actors/Actor for cast credits and Credit/Credits for crew credits, just like in the Profiler output. Is that doable? | | | My freeware tools for DVD Profiler users. Gunnar |
| Registered: March 18, 2007 | Reputation: | Posts: 6,461 |
| Posted: | | | | Quoting GSyren: Quote: OK, a couple of thought.
I would prefer if the variants went into a node with a name that isn't also used later, so perhaps Arguments/Argument, or something similar. And there seems no point in having FirstName, MiddleName and LastName there if they are not used as such (unless you have something else in mind for them). If you actually have them properly parsed, that would be really helpful.
And what I had in mind instead of CLTCredits/CLTCredit was Actors/Actor for cast credits and Credit/Credits for crew credits, just like in the Profiler output. Is that doable? Oh of course it is doable, actually I had forgotten they were different. I already made the change, but it was the same for both. Easy to fix. I agree about the variants, but I thought I was doing what you wanted. I'll leave it as variants then. If it needs to be different, then it already is (was - now I'll change it back). The names are properly parsed, and I have a Class coded with methods that I can tweak if anybody alerts me to a problem. Edit: I should say that I attempt to properly parse them. If it doesn't happen, it is a bug. For example, if the user types in "Queen Elizabeth" into the firstname field, the XML will have fn="Queen", mn="" and ln="Elizabeth". Have I tested all the ways a user could break my code? Probably not. My focus is on getting the CLT results correct at the moment. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: March 14, 2007 | Reputation: | Posts: 4,678 |
| Posted: | | | | Just to be clear what I meant: Quote: <Actor FirstName="melissa locsin" MiddleName="" LastName="" BirthYear="0" HashName="locsin_melissa_">melissa locsin</Actor> In this example you put the whole name in FirstName. I would have expected: <Actor FirstName="melissa" MiddleName="" LastName="locsin" BirthYear="0" HashName="locsin_melissa_">melissa locsin</Actor> (And Variant instead of Actor, of course) | | | My freeware tools for DVD Profiler users. Gunnar |
| Registered: March 18, 2007 | Reputation: | Posts: 6,461 |
| Posted: | | | | Quoting GSyren: Quote: Just to be clear what I meant:
Quote: <Actor FirstName="melissa locsin" MiddleName="" LastName="" BirthYear="0" HashName="locsin_melissa_">melissa locsin</Actor> In this example you put the whole name in FirstName. I would have expected:
<Actor FirstName="melissa" MiddleName="" LastName="locsin" BirthYear="0" HashName="locsin_melissa_">melissa locsin</Actor>
(And Variant instead of Actor, of course) That was not intended. I thought I had that covered, as my prior post indicates. Consider it a bug. My code has been through so many iterations, there was probably a time that it worked. I think my error is that for the XML, I am pulling the fields from the grid. Instead, all I need to do is populate an instance of my Actor class, and it will "automatically" fix the parsing. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
| Registered: May 29, 2007 | Reputation: | Posts: 3,475 |
| Posted: | | | | Quoting mediadogg: Quote: Quoting Kathy:
Quote: . Hey Kathy, glad to see you monitoring this thread. Any thoughts? None. I have no idea what any of this is about. I just posted the period because the data took up so much of my screen. |
| Registered: March 18, 2007 | Reputation: | Posts: 6,461 |
| Posted: | | | | Quoting Kathy: Quote: Quoting mediadogg:
Quote: Quoting Kathy:
Quote: . Hey Kathy, glad to see you monitoring this thread. Any thoughts?
None. I have no idea what any of this is about.
I just posted the period because the data took up so much of my screen. Yeah, I don't usually do that. I will be deleting it soon. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. |
| Registered: March 18, 2007 | Reputation: | Posts: 6,461 |
| Posted: | | | | Ok, I replaced the sample with what I hope is the "final" ( ) CLTBoss XML format. I remember a prior discussion where we had the choice of a different element name for Cast vs. Crew or an attribute. I think at one time I had both implemented, and I believe at that time you preferred the attribute. So, I didn't change that, giving us a pause to reconsider why and whether we want to change the prior choice. (if it matters, I prefer the single element type with a distinguishing attribute.) | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
|
|
Invelos Forums->General: Website Discussion |
Page:
1... 14 15 16 17 18 ...26 Previous Next
|
|
|
|
|
|
|
|
|