<?xml version="1.0" encoding="utf-8"?>
<!-- generator="FeedCreator 1.7.2-ppt DokuWiki" -->
<?xml-stylesheet href="http://tomasrohr.org/wiki/lib/exe/css.php?s=feed" type="text/css"?>
<rdf:RDF
    xmlns="http://purl.org/rss/1.0/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
    xmlns:dc="http://purl.org/dc/elements/1.1/">
    <channel rdf:about="http://tomasrohr.org/wiki/feed.php">
        <title>Tomasrohr.org blog</title>
        <description></description>
        <link>http://tomasrohr.org/wiki/</link>
        <image rdf:resource="http://tomasrohr.org/wiki/lib/tpl/mnml-blog/images/favicon.ico" />
       <dc:date>2026-05-03T11:19:10+02:00</dc:date>
        <items>
            <rdf:Seq>
                <rdf:li rdf:resource="http://tomasrohr.org/wiki/doku.php?id=blog:apache_kafka_tips_1_-_topics&amp;rev=1511352716&amp;do=diff"/>
                <rdf:li rdf:resource="http://tomasrohr.org/wiki/doku.php?id=blog:incremental_insert_in_hive_and_impala&amp;rev=1553864597&amp;do=diff"/>
                <rdf:li rdf:resource="http://tomasrohr.org/wiki/doku.php?id=blog:merge_in_hive&amp;rev=1553864560&amp;do=diff"/>
            </rdf:Seq>
        </items>
    </channel>
    <image rdf:about="http://tomasrohr.org/wiki/lib/tpl/mnml-blog/images/favicon.ico">
        <title>Tomasrohr.org</title>
        <link>http://tomasrohr.org/wiki/</link>
        <url>http://tomasrohr.org/wiki/lib/tpl/mnml-blog/images/favicon.ico</url>
    </image>
    <item rdf:about="http://tomasrohr.org/wiki/doku.php?id=blog:apache_kafka_tips_1_-_topics&amp;rev=1511352716&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2017-11-22T13:11:56+02:00</dc:date>
        <title>blog:apache_kafka_tips_1_-_topics</title>
        <link>http://tomasrohr.org/wiki/doku.php?id=blog:apache_kafka_tips_1_-_topics&amp;rev=1511352716&amp;do=diff</link>
        <description>Apache Kafka Tips #1 - Topics

1) Pojmenování topiků
Název musí popisovat data uvnitř. Jeden topik = jedna entita datového modelu (zákazník,  metriky z měřidel z daného okamžiku, zařízení, lokace) Též topik = jeden druh logických událostí (naměřené hodnoty z měřidel, události čidel (otevřené okno, atd), alerty (např. čidlo mimo provoz, otevření dveří po půlnoci) )</description>
    </item>
    <item rdf:about="http://tomasrohr.org/wiki/doku.php?id=blog:incremental_insert_in_hive_and_impala&amp;rev=1553864597&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2019-03-29T14:03:17+02:00</dc:date>
        <title>blog:incremental_insert_in_hive_and_impala</title>
        <link>http://tomasrohr.org/wiki/doku.php?id=blog:incremental_insert_in_hive_and_impala&amp;rev=1553864597&amp;do=diff</link>
        <description>Incremental insert in Hive and Impala

Task: To insert data set into a table, but if a record already exists in the target table, it must not be inserted. And you must overwrite whole partitions (which is a good practice on Hive).
The source data set may be a table or a query result.</description>
    </item>
    <item rdf:about="http://tomasrohr.org/wiki/doku.php?id=blog:merge_in_hive&amp;rev=1553864560&amp;do=diff">
        <dc:format>text/html</dc:format>
        <dc:date>2019-03-29T14:02:40+02:00</dc:date>
        <title>blog:merge_in_hive</title>
        <link>http://tomasrohr.org/wiki/doku.php?id=blog:merge_in_hive&amp;rev=1553864560&amp;do=diff</link>
        <description>Merge in Hive and Impala

Suppose you want to load data into an already populated Hive table and you want to apply data as update of existing records and insert of new records. Neither Hive nor Impala query language includes MERGE command (or UPSERT known from Teradata). Also it is a good practice to ovewrite a whole table partition at once. Here is the command which performs what you would achieve in RDBMS using</description>
    </item>
</rdf:RDF>
