![]() |
.. (לתיקייה המכילה) | |
Should the path to the dblp.xml document be a file or a url?
|
You should download the file manually to your computer. The path given to the program is a path in the local file system. No internet access is required for this program. |
When parsing the large.xml document I get an error from the
parsing framework which does not seem to be related to my code in anyway.
|
Pass the following parameter to the VM: -DentityExpansionLimit=1000000 |
Can we assume the xml file is valid?
| Yes, you may. |
What should we do with publications that have no year field or multiple year fields?
| Skip those with no year field and when there are many, just take the first one. |
How should we format the output?
|
Each line should correspond to an author and each field of a line should be separated by a space and they should appear in the same order as specified in the assignment -- i.e., author name, total number of papers, average number of papers per year, etc. |
how should we calculate the average number of papers per year
(since the first publication of this author) field?
|
Yes, the average refers to the years since the first listed publication to the last. That is, if an author's first publication was in 1975 and the last was 1990, the average should be calculated between these years. |
should years with no publication between the first and last years of
publication be treated as years with 0 publication?
| Yes, that's correct. |
How many decimal digits do we need for average and other double values?
| 1 is enough. |
Does the regular expression also refer to which coauthors we should include?
|
No -- the regex only refers to the author lines you will display -- not the information each line contains. |
Can we increase the memory of the JVM?
| Yes, you may -- just, please add these instructions in your readme file |
How should we round the floating point values?
|
As in the following examples: 1.65 --> 1.7 1.649 --> 1.6 10 --> 10.0 |
What should be the order of authors we output?
| Same order as they appear in the XML document. |

